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FOREWORD 

If there be any lingering doubts as to the wisdom of doing deeply funda- 
mental research in an industrial laboratory, this book should dissipate 
them. Dr. Shockley's purpose has been to set down an account of the 
current understanding of semiconductors, an understanding which inci- 
dentally is comprised in no mean degree of his own personal contributions. 
But he has done more than this. He has furnished us with a documented 
object lesson. For in its scope and detail this work is obviously a product 
of the power and resourcefulness of the collaborative industrial group of 
talented physicists, chemists, meta'lurgists and engineers with whom he is 
associated. And it is an almost trite example of how research directed at 
basic understanding of materials and their behavior, "pure" research if 
you will, sooner or later brings to the view of inventive minds engaged 
therein opportunities for producing valuable practical devices. 

The program of work which Dr. Shockley leads was aimed at under- 
standing a kind of materials, the semiconductors, which had already 
received considerable application in the communications business in the 
form of rectifiers, regulators and modulators. Not only were improvements 
in such devices hoped for but the possibility of creating an amplifier was 
envisioned. In the course of three years of intensive effort the amplifier 
has been realized by the invention of the device named the transistor. 

It would be unfair to imply that any and every fundamental research 
program may be expected to yield commercially valuable results in so short 
a time as has this work in the telephone laboratories. To achieve such 
results, careful choice of a ripe and promising field is prudent and a clear 
recognition of objectives certainly helps; but there should be no illusions 
about the necessity of a large measure of good luck. 

Solid state electronics, or transistor electronics as Dr. Shockley calls 
it, preceded and in one respect has always excelled vacuum electronics. 
This is true in communications engineering at least, for the crystal wireless 
detector preceded the audion and it is still the best detector when the going 
gets tough, as in microwaves. Nevertheless in the past forty years the 
vacuum tube is the tool which has shaped the whole electrical intelligence 
transmission art. It is an art traditionally based upon highly stable 
amplifiers in which distortion is rigorously suppressed and upon modulators 
in which distortion is precisely tailored to a useful purpose. Evidence is 
already strong that transistor devices will be developed having character- 
istics suitable for such exacting uses. 

vii 
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The newer methods of transmission by quantizing and time splitting, 
together with related fields such as electric computing, require large 
numbers of low power amplifiers and gating and flip-flop-circuits to handle 
pulses and stepwise current changes. Transistor devices are being found 
to have unique advantages in this type of circuitry. They are tiny, fast and 
efficient. Here the science of transistor electronics which Dr. Shockley 
and his colleagues have so effectively launched promises to lead into new 
areas of technology. The present volume should be a valuable guidebook. 

Ralph Bown, Director of Research 
Bell Telephone Laboratories 



PREFACE 

The hole, or deficit produced by removing an electron from the valence- 
bond structure of a crystal, is the chief reason for existence of this book. 
Although the hole and its negative counterpart, the excess electron, have 
been prominent in the theory of solids since the work of A. H. Wilson in 
1931, the announcement of the transistor in 1948 has given holes and 
electrons new technological significance. From the theoretical viewpoint, 
the hole is an abstraction from a much more complex situation and the 
achieving of this abstraction in a logical way appears inevitably to involve 
rather detailed quantum-mechanical considerations. From the experi- 
mental viewpoint, in contrast, the existence of holes and electrons as 
positive and negative carriers of current can be inferred directly by the 
experimental techniques of transistor electronics so that holes and electrons 
have acquired an operational reality in Bridgman's sense of the word. 
Furthermore, the new experiments have established the quantitative 
aspects of the behaviors of holes and electrons with sufficient accuracy 
for many of the purposes of transistor electronics. Thus, in the level of 
abstraction, there is a great difference between theory and experiment; 
this difference is reflected in the organization of the book. 

In Part I, only the simplest theoretical concepts are introduced and the 
main emphasis is laid upon interpretation in terms of experimental results. 
This material is intended to be accessible to electrical engineers or under- 
graduate physicists with no knowledge of quantum theory or wave me- 
chanics. It should serve as a basis for understanding the operation of 
transistor devices and for elementary design considerations. 

Part III, at the other extreme, is intended to show how fundamental 
quantum theory leads to the abstractions of holes and electrons. In order 
to make the spirit, if not the details, of these mathematical investigations 
accessible to readers without extensive training in theoretical physics, an 
introductory discussion has been prepared. This material, presented in 
Chapters 13 and 14, is intended to answer questions that frequently arise 
about wave functions and their interpretation. Chapter 14 makes con- 
siderable use of electrical engineering analogues and covers most of the 
basic quantum-mechanical principles needed for the subsequent treatment 
of holes and electrons. Part III also contains an introduction to statistical 
mechanics and other topics applicable to the theory of electronic conduction 
in crystals. ' 
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Part II attempts to bridge the gap between Parts I and III by presenting 
the reasoning and results of Part III in pictorial and descriptive terms. 
The closing chapter of Part II reexpresses some of the theory in analytic 
form useful for quantitative studies of transistor phenomena. This 
chapter has also been used as a repository for some late developments in 
transistor electronics. 

Problems follow many of the chapters. Some of these are simply 
numerical examples intended to give the reader a feeling for the orders of 
magnitude involved, others are intended to supplement or extend the 
mathematics in the text, and still others contain results which might well 
be discussed at length in the text but which have been stated as problems 
in the interests of brevity. 

The endeavor to probe deeply into the logical consequences of the funda- 
mental theory, to reduce these consequences to pictorial terms and to find 
experimental counterparts to the theoretical concepts is in keeping with 
the philosophy of research at Bell Telephone Laboratories. The invention 
of the transistor occurred in connection with a research program based on 
this philosophy and the development and content of this book reflect the 
same philosophy. The modus operandi of research programs like that 
associated with the invention of the transistor is to seek primarily for a 
fundamental understanding of the phenomena being investigated while at 
the same time remaining alert for possible applications. 

I have frequently found it helpful in my own thinking to consider that 
fundamental understanding for many solid state problems is achieved 
when four questions can be answered: 

(1) What are the atoms involved and how are they arranged? 

(2) How did this arrangement come into being? 

(3) How does this arrangement lead to certain mechanisms of electronic . 
and atomic motion ? 

(4) How do these mechanisms give rise to the observed properties ? 

In terms of these questions, the phenomena of conduction are understood 
and so is carrier injection by p-n junctions. Many other phenomena dis- 
cussed in this book are understood in so far as question (4) and part of 
question (3) are concerned; a completion of the understanding of some of 
these phenomena is currently a major research aim. 

This book had its origin in a series of lectures given at Bell Telephone 
Laboratories in connection with the growth of the transistor program. It 
thus owes its existence basically to the invention of the transistor by 
J. Bardeen and W. H. Brattain. Its content and organization have been 
influenced by expositional needs discovered during the lectures. The 
emphasis is accordingly on those materials and phenomena that are most 
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prominent in transistor electronics and numerous other semiconductors 
and effects are omitted entirely. 

The preparation of this book has required support in a variety of ways. 
The encouragement and constructive criticism of R. Bown, J. A. Morton, 
R. M. Ryder, W. G. Pfann, L. A. Meacham and J. J. Markham have been 
of great value and have resulted in particular in the arrangement of Part I, 
the existence of Chapter 14 and various improvements in Chapters 15 and 
17. For other assistance, I am indebted in particular to R. D. Heidenreich 
and also to J. Bardeen, G. C. Danielson, W. G. Dow, J. B. Johnson, C. 
Kittel, K. G. McKay, P. H. Miller, Miss D. J. Oxman, F. Seitz, Mrs. G. V. 
Smith, L. Tisza and many others. W. van Roosbroeck has commented 
on nearly the entire manuscript, has carried out the calculations for many 
of the figures in addition to undertaking responsibility for the index. The 
figures themselves have benefited greatly from the efforts of Sidney Lund 
and the late B. A. Clarke. The very tedious and difficult task of typing 
the several drafts has been admirably handled by our transcription depart- 
ment. The problem of organizing the work and keeping track of the 
many details has been solved for me by Mrs. E. M. Sparks. The encourage- 
ment and cooperation of Jean B. Shockley have been essential to the work. 

October 21, 1950 WlLLIAM SHOCKLEY 
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" : E = dE/dt. 
*: 6 Uy*-= complex conjugate of ^y. 
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CHAPTER 1 
THE BULK PROPERTIES OF SEMICONDUCTORS 

Between 1940 and 1950, the understanding of semiconductors was very 
greatly increased. During that decade a substantially complete picture 
was developed of the energy level schemes and motions of electrons 
in silicon and germanium. This understanding is an outgrowth of the 
research and development program on crystal rectifiers undertaken in con- 
nection with the radar program during the war and continued in several 
laboratories thereafter. Some of the wartime work was carried out in the 
Radiation Laboratory of M.I.T., which operated under the supervision of 
the National Defense Research Committee. The Radiation Laboratories 
Series volume Crystal Rectifiers 1 by H. C. Torrey and C. A. Whitmer reports 
this program and mentions in particular as chief contributors to crystal 
research and development in England: the General Electric Company, 
British Thompson-Houston Ltd., Telecommunications Research Establish- 
ment and Oxford University ; and in the United States : the Bell Telephone 
Laboratories, Westinghouse Research Laboratory, General Electric Com- 
pany, Sylvania Electric Products, Inc., and the E. I. du Pont de Nemours 
and Company. It is also pointed out that the crystal groups at the 
University of Pennsylvania and Purdue University, who operated under 
N.D.R.C. contracts, were responsible for much fundamental work. 

With the advent of the transistor, the role of semiconductors in elec- 
tronic technology has assumed much greater prominence. In addition, 
techniques developed in connection with the transistor program have led 
to new experimental information concerning the basic processes of elec- 
tronic conduction in semiconductors. Occupying a central position in 
these researches are germanium filaments and micromanipulators, like 
those shown in the frontispiece. These filaments play the same role as do 
the tubes in vacuum and gas discharge electronics. In fact a close analogy 
can be drawn between the science and application of conventional elec- 
tronics and those of the new electronics of transistors. We shall accord- 
ingly use the phrase transistor electronics to designate the field in semi- 
conductor physics that covers fundamental processes, the analysis and 
design of devices and the theory of their circuit applications. All of these 
subjects are treated to some degree in the following chapters. 
The semiconductors of interest in transistor electronics are electronic 

1 McGraw-Hill Book Company, 1948. 
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rather than ionic conductors. In ionic conductors a substantial fraction 
of the current is transported with an accompanying motion of the ions. 
Since the positions of the ions and atoms determine the physical structure 
of the material and its chemical composition, ionic conduction produces 
radical changes in the sample. In electronic conductors, however, the 
atoms stay in the same positions. They may lose or gain electrons during 
the conduction process, but the structure and chemical composition are 
unaffected. 

1 1 EXCESS ELECTRONS AND HOLES AS CURRENT 

CARRIERS 

Basic' to the theory of semiconductors is the idea that electrons can 
carry current in two distinguishable and distinctly different ways: ^ one 
being called by "excess conduction", "conduction by excess electrons , or 
simply "conduction by electrons" and the other being called "defect con- 
duction" or "conduction by holes". The possibility that these two proc- 
esses may be simultaneously and separably active in a semiconductor 
affords a basis for explaining transistor action. 

The quantum mechanical explanation of these processes in terms ot. 
"energy bands" and "Brillouin zones" is well developed and is one of the 
chief topics dealt with in the following chapters. In this introductory 
chapter, an elementary survey of the behavior of electrons in silicon and 
germanium will be presented. The general features discussed here are 
then explained on a more advanced level by the theory m the later 

chapters. , „ . 

Silicon and germanium form what are called covalent crystals the 
atoms being held together by "electron-pair bonds" formed by the valence 
electrons. Preparatory to considering the electronic structure of the 
crystals, we shall first describe the covalent bond in the hydrogen molecule, 
which has the simplest electron pair bond. Figure 1.1 represents two 
hydrogen atoms and a hydrogen molecule. Each atom consists of a 
proton and one electron. The proton weighs approximately 2000 times 
as much as the electron and is a relatively immobile particle about which 
the electron moves in its orbit or quantum mechanical wave function. 
(We shall discuss the interpretation of wave functions further in Chapters 
5 and 14 ) In an isolated atom, this wave function has spherical sym- 
metry, and the electronic charge is distributed on the average as a dif- 
fuse sphere centered about the proton. When the two atoms are brought 
close together, interaction between the wave functions of the two electrons 
takes place, and the electronic clouds become modified as suggested in the 
bottom part of Figure 1.1. The result is to produce an extra accumulation 
of charge between the two protons which acts to bind them together. 
According to quantum mechanical laws associated with the "Pauli exclusion 
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principle", the bond is especially stable when it contains precisely two 
electrons. It is weakened considerably by removal of one electron and is 
not greatly strengthened by the addition of a third electron. This special 
stability of the electron-pair bond or covalent bond is a fundamental fact 
of chemistry which is now quite well understood on the basis of wave 
mechanics. 1 
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Fig. 1-1— Electron-Pair Bond in the Hydrogen Molecule. 

In the periodic table of the elements, as shown in an appendix, covalent 
bonding is especially important in the elements of Class III which, according 
to Hume-Rothery's classification, form valence crystals. 1 * The elements 
carbon, silicon and germanium which come in group IV of the table have 
atomic numbers, usually denoted as Z, of 6, 14 and 32. These numbers 
are equal to the charge on the atomic nucleus measured in units of the 
electronic charge. A neutral atom thus has a nuclear charge of +Ze 
surrounded by Z electrons each of charge — e. 

The elements carbon, silicon, and germanium have the common feature 

1 See L. Pauling, The Nature of the Chemical Bond, Cornell University Press, 1939. 
la W. Hume-Rothery, "The Structure of Metals and Alloy," Institute of Metals, Lon- 
don (1936) revised 1950. 
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Fig. 1-2— The Electronic Structures of Carbon, Silicon, and Germanium Atoms. 




Fig. 1-3 — The Diamond Structure, Showing How Each Atom Forms Four Bonds 

with Its Nearest Neighbors. (- 

(The lattice constant, denoted by a, is the cube edge in this figure. For diamond, 
silicon, germanium, and gray tin, its value is respectively 3.56,(5.42j 5.62, and 6.46A 
where IA = 1 angstrom = 10"" 8 cm. The distance between nearest neighbors is 
V3/16 a. The figure shows 18 atoms, but only 8 really belong to the volume a 3 ; 
since the 8 corner atoms are each shared by 8 cubes, they contribute only 1 ; the 6 face 
atoms are each shared by 2 cubes and contribute 3; and there are 4 atoms wholly 
inside the cube. The number of atoms per cm 3 is thus 8/a 3 = 17.7,- 5.00^4.52 and 
2.82 X 10 22 respectively.) 
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indicated in Figure 1.2 of being tetravalent. Although they possess re- 
spectively 6, 14, and 32 electrons all together, in each case only four of these 
are able to enter into chemical reactions. The remaining electrons are 
closely bound to the nucleus producing a stable "ionic core" having a net 
charge of +4 units. This core can be regarded as completely inactive so 
far as electronic processes in chemical reactions and in semiconductors are 
concerned. 

Each of these atoms tends to form covalent or electron pair bonds with 
four other atoms. This tendency is completely satisfied in the diamond 
structure which is the crystalline form of all three elements. This struc- 
ture, shown in Figure 1.3, is a cubic arrangement and may be regarded as 
made up of eight interpenetrating, simple cubic lattices like the one formed 
by the atoms on the eight corners of the large cube shown. Although in- 
teresting features in the theory of conduction in the diamond lattice arise 
from the detailed nature of its crystal structure, from the point of view of 
this chapter we are interested only in the feature represented in the upper 
left-hand corner of Figure 1.3. This part of the figure shows that each 
typical atom is surrounded by four neighbors regularly placed about it, 
with which it forms four covalent bonds. These neighbors are arranged 
on the corners of a regular tetrahedron in conformity with the known 
chemical behavior of the tetrahedral carbon atom. 2 For purposes of dis- 
cussion of the conductivity in these crystals, we shall represent the three- 
dimensional array in two dimensions as is shown in Figure 1.4, indicating 
that each carbon atom forms an electron-pair bond with four neighbors. 
The crystal is, of course, electrically neutral as may be seen by considering 
one ionic core and its share of the charge in the four electron-pair bonds 
which surround it. Each such unit is electrically neutral as shown in (b). 
When impurity atoms are present, as we shall discuss later, units like (b) 
are not always electrically neutral. 

On the basis of this valence-bond structure we can intuitively see why 
diamond should be an insulator. Although it contains a large number of 
electrons, as does a metal, the covalent bondjs an entirely different struc- 
ture from the metallic bond. The metallic bond is frequently described 
by saying that the electrons behave substantially like a gas of free electrons 
producing a uniform cloud of negative charge in which the positive ions of 
the metal float. In the case of diamond, however, the electrons do not 
behave like a gas permeating the crystal but instead more closely resemble 
structural elements which hold the crystal together. In an ideally perfect 
crystal, each valence bond would contain its two electrons; therefore, 

2 Long before the arrangement of atoms in the diamond crystal was established by X rays, 
the organic chemists had concluded that carbon formed four bonds at the tetrahedral angles — 
a truly remarkable inference from observations of the optical properties of solutions of organic 
compounds. 
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every electron would be tightly bound and thus unable to enter into the 
conduction process. The situation may be crudely represented by the 
analog of an automobile storage garage in which one floor is completely 
filled with automobiles. In this case, no flow of traffic would be possible 

Conductivity can be produced in diamond, however, in a number of 
ways, all of which involve destroying the perfection of the valence bond 
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structure. Thus if high-energy particles or quanta of radiation fall upon 
the crystal, they can break the bonds. Conductivity in diamond induced 
by bombardment in this way has recently received considerable prominence 
in connection with "crystal counters" which have been used to detect 
nuclear particles and in experiments on electron-bombardment induced 

3 The theory of energy bands, discussed in Chapter 5, especially Section 5.6, shows how 
there may be a continuous transition from valence-bond structures to metallic structures On 
this basis the metallic properties of tin and lead do not contradict the theory presented here. 
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conductivity. 4 In Figure 1.5(a) we represent a photon delivering its 
energy to an electron which is ejected from one of the bonds. This ejected 
electron constitutes a localized negative charge in the crystal as shown m 
(c) since before it arrived in that part of the crystal the electron-pair bond 
structure was electrically neutral. Such an electron, which represents an 
excess over and above that required to complete the bond structure in its 
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p IG _ i-S— Excess Electrons and Holes in a Diamond Crystal. 

neighborhood, is called an "excess electron". Since it cannot enter any of 
the completed bonds in (c), it migrates about in a random manner in the 
crystal under the influence of thermal agitation. If an electric field is 
applied, it tends to drift in the direction of the applied force and to carry a 
current. Its behavior is represented by the vehicle on the second floor 

* D E Wooldridge, A. J. Ahearn and J. A. Burton, Phys. Rev. 71, 913 (1947). P. J. Van 
Heerden, Thesis (Utrecht, 1945). K. G. McKay, Phys. Rev. 77, 816-825 (1950) references^ 

6 F S Goucher, Bulletin of the 298th Meeting of the American Physical Society, has reported 
evidence that every photon in the wave-length range of 1.0 to 1.8 M absorbed in germanium 
produces a hole-electron pair. 
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of (b), which is now free to move. Like the vehicle, the electron also has 
been lifted to a state of higher energy by being removed from the bond. 
As mentioned earlier, this process of conduction by excess electrons is re- 
ferred to simply as conduction by "electrons". This procedure serves to 
distinguish it from the other process whereby electrons also conduct, called 
conduction by "holes", discussed below. 

The vacant space in (b) now permits traffic to flow on the first floor, in 
the analogy. A similar process takes place in the crystal through the 
motion of the hole left in the bond when the ejected electron moved away. 
As is shown in (d), this hole constitutes a net, localized, positive charge in 
the crystal, since before it was introduced that part of the crystal was elec- 
trically neutral. Its motion takes place, as shown in (d), by a reciprocal 
motion of electrons in the valence bonds (just as the vacant parking place 
in (b) can move owing to the successive motions of vehicles into it). 
Under the influence of an electric field, the random motion of the hole 
acquires a systematic drift, and it can also contribute to the current. 6 

Current flow in an illuminated diamond crystal is represented in Figure 
1.6. The electrons and holes, produced in pairs by the photons, drift in 
opposite directions in the field; the electron, being negative, drifts in the 
opposite direction from the applied field, but the electric current it produces 
is, of course, in the direction of the field. In the case of the hole, the re- 
ciprocal electron motions are also opposite to the direction of the field 
(on the average). As a consequence, the net result is that the hole drifts 
to the right, and since its charge is positive it produces a current to the 
right. (The result obtained could be represented by raising the right 
side of the garage of Figure 1.5 (b). The vehicle on the top floor would run 
to the left and the vacant place on the first floor would shift to the right.) 
If the source of illumination is removed, the photoconductivity dies away 
and the crystal then returns to its normal state. This can occur by the 
recombination of holes and electrons as shown in Figure 1.6(b). If the 
electron drops into the hole, both the hole and electron disappear and the 
bond structure becomes complete, the excess energy being given up to 
the atoms in the form of thermal vibrations as suggested in Figure 1.6(b). 7 

On the basis of quantum-mechanical theory, as discussed in Chapter 7, 

6 In this chapter we present an oversimplified picture of the conduction process. Neither 
the electron nor hole may be considered so localized as shown in Figure 1.5; they are even 
less localized than in Figure 1.14. The replacement process for the hole is not so simple either. 
So far as the idea of two kinds of conduction is concerned, the picture presented here is correct. 
Chapters S to 8 are required to give the same picture in more precise terms. The basic theory 
of semiconductors, which gives analytic form to the ideas discussed here, is due to A. H. 
Wilson, Proc. Roy. Soc. 133A, 458 (1931). 

7 The process of recombination may actually be much more complicated and may involve 
intermediate stages in which the hole or the electron is trapped. For further discussion see 
K. G. McKay, Phys. Rev. 74, 1606-1621 (1948); and Section 3.1d and Chapter 12. 
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one may think of the hole as moving through the crystal as a positively 
charged particle with much the same attributes as a free electron except 
for the sign of its charge. 

It is evident that an important distinction exists between behavior of 
electrons which have been excited out of the valence bond structure and 
those which remain in it. The concepts and terminology of the theory of 
Brillouin zones and energy bands have been developed to describe these 
distinctions. According to this theory, which is discussed in Chapters 5 
to 7, electrons in the valence bonds occupy a set of energy levels covering a 
certain band of allowed energies. All of these energy levels are occupied 
in the ideal crystal, and the band of energy levels is referred to as the filled 
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band, valence-bond band, or simply valence band. It corresponds to the 
first floor of the garage. It is impossible for an electron to have a higher 
energy than the highest state in the filled band unless it is given a suffi- 
ciently large increment of energy so that it may become a free or excess 
electron of the sort discussed previously. The electrons in these free states 
may also have various energies of motion and thus give rise to the energy 
level scheme of the empty band or conduction band (the second floor). The 
filled band is also a band which permits conduction but is not referred to as 
a "conduction band". The filled and empty bands are separated by a 
region of forbidden energies for which there are no energy levels for the 
electrons in the crystal. In this introduction the results of the energy 
band theory will be used without reference to their theoretical basis. 

If the temperature is sufficiently elevated, spontaneous breaking of the 
covalent bonds by thermal agitation will occur, producing electrons and 
holes in equal numbers. This effect would occur at such high temperatures 
in diamond that it has not been observed. However, it plays a major 
role in silicon and germanium at temperatures well within the range of 
investigation in the laboratory. 

1.2 IMPURITY SEMICONDUCTORS; DONORS AND 
ACCEPTORS 

If the only cases of conductivity open to investigation were like those 
already discussed, for which electrons and holes are present in equal num- 
bers, the problem of interpreting the data would be very difficult. For- 
tunately, in the semiconductors silicon and germanium, there are cases in 
which conductivity is due to excess electrons only or to holes only. We 
shall discuss some specific examples for silicon 1 and indicate later how these 
are related to germanium. 

If the conductivity of the sample is due to excess electrons it is called 
n-type, since the current carriers act like negative charges; if due to holes, it 
is called p-type, since the carriers act like positive charges. Figure 1.7 shows 
an example of «-type silicon. The conductivity arises from the presence 
of arsenic atoms which are termed "impurities", even though added de- 
liberately in the otherwise pure silicon. The arsenic atom, as indicated, 
has five valence electrons surrounding a core having a charge of +5 units. 
On the basis of evidence which we shall discuss, it is believed that each 
arsenic atom displaces one of the silicon atoms from its regular site and 
forms four covalent bonds with the neighboring silicon atoms as shown in 
Figure 1.7, thus using four of its five valence electrons. The extra electron 

1 A systematic analysis of the behavior of silicon with impurities of the sorts discussed here 
was first carried out by J. H. Scaff, H. C. Theuerer, and E. E. Schumacher, /. of Metals 185, 
383-388 (1949). Their work was stimulated by the development of silicon detectors for 
microwave use by R. S. Ohl, also of Bell Telephone Laboratories, in the pre-war years. 
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cannot fit into these four bonds and is free to move about the crystal. As 
discussed previously, this excess electron constitutes a mobile, localized 
negative charge. The arsenic atom, on the other hand, is an immobile, 
localized positive charge, since its core, with a charge of +5 units, is not 
neutralized by its share ( — 4) of the charge in the valence bonds. Its net 
charge, therefore, just balances that of the excess electron it contributes 
to the crystal. Thus arsenic impurity atoms add excess electrons but do 
not disturb the over-all electrical neutrality of the crystal. The negative 
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Fig. 1-7 — Excess or Electron Conduction in Silicon Containing Arsenic. 



electrons are attracted to the positive arsenic atoms and at low tempera- 
tures become bound to them. However, at room temperature, thermal 
agitation shakes them off and the electrons are free as shown in Figure 1.7. 
(Measurements of conductivity over wide temperature ranges yield valu- 
able information, as we shall discuss later.) 

A p-type semiconductor is shown in Figure 1.8. In this case the added 
impurity, boron, has a valence of three and therefore cannot complete the 
valence bond structure surrounding it. The hole in one of the bonds to the 
boron atom can be filled by an electron from an adjacent bond, and the 
hole can thus migrate away, as described in Figure 1.5(d). The boron 
thus becomes an immobile, localized negative charge. Because of the 
symmetry between the behavior of holes and electrons, we can describe 
the situation shown in Figure 1.8 by saying that the negative boron atom 
attracts the positively charged hole but that thermal agitation shakes the 
latter off at room temperature so that it is free to wander about and con- 
tribute to the conductivity. 
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Impurities with a valence of five are called "donor impurities" because 
they donate an excess electron to the crystal; those with a valence of three 
are called "acceptor impurities", since they accept an electron from some- 
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Fig. 1-8 — Defect or Hole Conduction in Silicon Containing Boron. 

where else in the crystal to complete the structure of the valence bonds 
with their neighbors, thus leaving a hole to conduct. These terms, to- 
gether with the other features already discussed, are summarized in Table 
1.1. 

Table 1.1 Summary of Features of Impurity Conduction in 
Silicon and Germanium 



Conductivity Type 


w-Type or Excess 


p-Type or Defect 


Conduction by 


(excess) electrons 


holes 


Energy band in which carrier 
moves 


empty or conduction 


full or valence-bond 
or valence 


Sign of carrier 


negative 


positive 


Valence of impurity atom 


5 


3 


Name for impurity atom 


donor 


acceptor 


Typical impurities 


Elements of Group IV: 
Phosphorus, P 
Arsenic, As 
Antimony, Sb 


Elements of Group III: 
Boron, B 
Aluminum, Al 
Gallium, Ga 
Indium, In 
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In Figure 1.9 we illustrate how the number of carriers depends on tem- 
perature for a sample of silicon with five donors and two acceptors. At 
absolute zero, the three extra electrons furnished by these seven atoms 
are all bound to the donors. As the temperature is raised to — 125°C, or 
148°K (or absolute temperature on the Kelvin scale), thermal agitation is 
sufficient to shake off the electrons. However, they recombine so rapidly 
that on the average only one electron out of three is free to conduct. At 
room temperature, two out of three can conduct; however, thermal agita- 



0- 


@ © 


© 




© 


"® 


Wr 





© 
- (± 


) 




w 




© 


© 


(§- 





© 



-® 



© 
_ © 
© 



© + - + 

_ ®_ + 
©+_© 

-© i 



ABSOLUTE ZERO 



ROOM TEMPERATURE 



(+) DONORS (n-TYPE) ELECTRONS 

Q ACCEPTORS (P-TYPE) -|- HOLES 

Fig. 1-9 — Dependence upon Temperature of Hole and Electron Densities 
in w-Type Silicon. 

tion is insufficient to break the valence bonds, and the number of holes is 
about 10 -12 times the number of electrons.. 2 (The situation at room 
temperature in germanium is quite different, and the hole concentration may 
be about 1 percent of the electron concentration in high resistivity samples. 3 ) 
At higher temperatures an appreciable fraction of the valence bonds is 
broken thermally, and holes and electrons are created in pairs. In Figure 
1.9 at 630°C, the number of carriers produced thermally outweighs those 
due to impurities by 6 to 1, and the conductivity is essentially the same as 
if there were no impurities present at all. Under these conditions the 
sample is said to be in the intrinsic range, since the conductivity exhibited 
is an intrinsic property of silicon itself and independent of the impurity 
content. 

The data shown in Figure 1.9 have been chosen to fit, approximately, an 
actual silicon sample (see Figure 1.12) studied by Pearson and Bardeen. 

2 This value is estimated by extrapolating Figure 1.12 discussed later with the aid of Figure 
16.9 in Chapter 16. 

3 The situation for germanium at room temperature is discussed in Section 10.4 and shown 
in Figure 10.7. 
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1.3 INTERPRETATION OF DATA ON CONDUCTIVITY 
AND HALL EFFECT 

Pearson and Bardeen 1 have investigated a number of silicon samples 
prepared by J. H. Scaff and his colleagues containing varying amounts of 
added phosphorus and boron. We shall quote a number of results from 
their paper, to which the reader is referred for additional technical details. 
Before discussing the nature of these results, however, we should indicate 
the general outlines of the experimental procedure used for obtaining the 
basic date on the numbers and behaviors of the electrons and holes. 

The conductivity of a semiconductor containing holes and electrons 
depends upon the number present and the ease with which they are moved 
by an applied electric field. (These subjects are discussed at length in 
Chapters 8, 10, and 11.) This latter property is called the "mobility" and 
is expressed as the drift velocity of the particle in centimeters per second in 
an electric field of one volt per centimeter. It has, therefore, the dimensions 
of cm 2 /volt sec. The conductivity (in ohm -1 cm -1 ) due to excess electrons 
is equal to the total charge per unit volume (in coulombs/cm 3 ) of the 
electrons which are free to move times the mobility. The total charge 
is, of course, equal to the density of electrons (in cm -3 ) times e the electronic 
charge (1.60 X 10 - 19 coulombs). Combining this with the conductivity 
due to holes leads to the equation for conductivity 

a = e(nn„ + pn P ) (1) 

where n andp (for negative carriers and positive carriers) are the densities 
of holes and electrons, respectively, and \x n and \i. p their respective mobili- 
ties. (This formula is derived in connection with Figure 8.6.) 

A measurement of conductivity alone, therefore, gives only one item 
of information about the four unknown quantities occurring in this 
equation. Fortunately, another relationship between these same four 
quantities can be obtained by measuring the Hall effect. 

The Hall effect, described in Chapter 8 in more detail, occurs when a 
magnetic field is applied at right angles to the direction of current flow in 
the specimen. It is found that this produces a transverse voltage, which 
can be measured. When the Hall effect has been measured, it gives 
another equation relating the four unknown quantities. However, if 
an w-type sample is dealt with in the impurity conduction temperature 
range, p may be taken as equal to zero. Under these conditions measure- 
ments of conductivity and Hall effect permit a separate determination of 
n and n n . Similarly, p and p. v can be determined in p-type samples. Thus, 
by combining information on various samples it is possible to find the 

1 G. L. Pearson and J. Bardeen, Phys. Rev. 75, 865-883 (1949). 
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behavior of n n and m p as a function of temperature. With n n and n P 
regarded as known, it is then possible to obtain the values of n and p even 
in the case where both holes and electrons are simultaneously present. 
The mobilities for several cases of interest are quoted in Table 1.2. The 
method of measurement is discussed in Section 3.1 and Chapters 8, 11, 
and 12. 

Table 1.2 Mobilities in Cm/Sec per Volt/Cm 

Electrons fi n Holes p. v 

Carbon (diamond) >400° >200" 

900" — 

156 c — 

Silicon 300 d 100* 

Germanium 

(Hall mobility)..' 2600" 1700" 

(Drift mobility) 3600' 1700' 

° K. G. McKay, Phys. Rev. 77, 816-825 (1950). 
6 C. C. Klick and R. J. Maurer, Phys. Rev. 76, 179 (1949). 
<X Seitz, Phys. Rev. 73, 549-564 (1948). 

" G. L. Pearson and J. Bardeen, Phys. Rev. 75, 865-883 (1949). 
e G. L. Pearson, Phys. Rev. 76, 179 (1949). 

'}. R. Haynes, unpublished results presented to the American Physical Society at Chicago, 
November 26, 1949. (See Section 3.1 and Chapter 12 for further details.) 



We shall next consider the data analyzed by Pearson and Bardeen for 
a series of samples containing various amounts of added phosphorus. 
Phosphorus, like arsenic, has a valence of five and produces w-type material 
(it was not used in the example of Figure 1.7 in order to reduce the con- 
fusion between the symbol P for phosphorus and ^-type, which is not 
produced by phosphorus). Figure 1.10 shows the resistivities for four 
samples with varying phosphorous content plotted as a function of temper- 



ature. 2 



Data from the Hall effect are plotted in Figure 1.11. The results have 
been expressed in terms of the density of excess electrons per cubic centi- 
meter. Since the density of silicon atoms is 5.00 X 10 22 per cubic centi- 
meter, it is seen that sample D corresponds to one electron per 250 atoms 
in the crystal. For sample A at room temperature, on the other hand, 
there is only one conduction electron for approximately 500,000 atoms in 

2 The phosphorous content on the curves refers to that added to the molten silicon. Some 
of this phosphorus probably evaporates before being incorporated in the solidified silicon; 
furthermore, the phosphorus tends to remain in the molten silicon and be segregated in the 
last portions to solidify. The net result is to reduce the phosphorous content in the samples 
measured for Hall effect below the proportion added to the melt. Experimental evidence 
that one excess electron is contributed by each donor in the solid is furnished by data obtained 
with radioactive antimony in germanium discussed at the end of this chapter. 
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Fig. 1-10 — Resistivity of Silicon-Phosphorous Alloys Versus Temperature. 
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the lattice. Figure 1.12 shows a more detailed analysis of sample A. 
This diagram shows both experimental points (the circles and dots) and 
theoretical curves (the solid lines) obtained on the basis of considerations 
of the energy level diagram for the semiconductor and the Fermi-Dirac 
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Fig. 1-12 — Comparison between Predicted and Observed Electron and Hole Densities 

Versus Temperature. 

statistics, subjects discussed in later chapters. It it seen that the theo- 
retical interpretation of the data leaves little to be desired. The physical 
picture behind the theory is the one already described in connection with 
Figure 1.9. The concentrations of acceptors and donors and for sample 
A were 1.5 X 10 16 and 12 X 10 16 cm -3 (a ratio of 1 to 8 whereas in Figure 
1.9 we have shown a ratio of 2 to 5). 

The mobility of the electrons in the various samples is shown in Figure 
1.13. It is to be noted that at high temperatures there is an "intrinsic 
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mobility", at least for samples A and B (C and D we will discuss later), 
which arises from the interference to the electrons' motion due to thermal 
vibrations of the atoms of the crystal. For small impurity concentrations 
this term is independent of the impurities. For lower temperatures, the 
mobility is reduced by the impurities; in this case the process which limits 
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Fig. 1-13 — Mobility of Electrons in Silicon-Phosphorous Alloys Versus Temperature. 

the speed of drift in an applied electric field is the deflection of the drifting 
electrons by the electrostatic field of the charged impurity atoms. These 
subjects are discussed in Chapter 11. 

The reader experienced with plots like Figures 1.11 and 1.12 will be 
aware that from them activation energies can be obtained; just as they 
can be obtained from the Richardson plot for thermionic emission or the 
plot of the temperature dependence of a chemical reaction constant. In 
all of these cases one encounters terms of the form 



y = e^ /kT 



(2) 
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where y is the quantity studied, 8 is the "activation energy" involved, and 
kTis thermal energy. If In v is plotted against 1/T, a straight line 

lny = -(S/*)X (1/T) (3) 

is obtained whose slope is (&/k) from which 8 can be found since k (Boltz- 
mann's constant) is known. 

There are three energies of particular importance that can be obtained 
from studies of the slopes of the In n and In p versus \/T plots. (However, 
the analysis involves much more than simply finding the slope of the plots.) 
These energies are (1) the energy required to break a valence bond and 
create a hole and electron pair and (2) the energy required to remove an 
electron from a donor or (3) a hole from an acceptor. The energy required 
to break a bond can be obtained almost at once from the slope of the line 
of Figure 1.12 in the intrinsic range. Theory shows that this slope should 
give &i/2k where Si is the energy required to break the bond. The 
analysis for 8 2 in »-type samples or 83 in ^>-type samples is much more 
involved. At very low temperatures the slopes are expected to be S2/& 
and not S 2 /2£ (see Chapter 16, Figures 16.7 and 16.8 for details). How- 
ever, values for these energies may be obtained by making a proper analysis 
of the data. The values obtained by Pearson and Bardeen for silicon are: 

Energy to break bond 1.11 electron volts 

Energy to remove electron from donor 0.054 electron volt 
Energy to remove hole from acceptor 0.08 electron volt 

(An equally thorough analysis for germanium has not been published; we 
have chosen representative values of 0.72, 0.04, and 0.04 for purposes of 
illustration. For diamond, the value to break the bond is 6 or 7 electron 
volts from ultraviolet absorption data; the energies estimated as described 
below to separate electrons from donors or holes from acceptors are prob- 
ably about 0.35 electron volt. 3 ) 

The small energies of binding of electrons to the donors and of holes to 
the acceptors can be explained on the basis of a model proposed by H. A. 
Bethe. The electron is supposed to move in the field of the donor in 
much the same way as an electron moves about the proton in a hydrogen 
atom. 4 There is one important difference: Because the net positive 
charge of the phosphorous ion is embedded in a dielectric medium, the 
force of attraction is reduced by the dielectric constant — about 13 for 
silicon. As a result of this the electron in the bound state about the 

3 Ultraviolet data from A. J. Ahearn, Phys. Rev. 73, 1113 (1948). Ahearn also finds that 
10 ev of a-particle energy are required for each bombardment-produced electron, and K. G. 
McKay, Phys. Rev. 77, 816-825 (1950), finds 10 ev for 14-kv electrons. The dielectric con- 
stant of diamond is estimated as about 6.2 from its index of refraction. McKay finds trap 
energies of about 0.3 electron volt for electrons. 

4 This type of approximation was first published by G. Wannier, Phys. Rev. 52, 191-197 
(1937). 
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phosphorous atom moves in a very large orbit whose radius is 13 times 
that of an electron in a hydrogen atom, giving a binding energy of about 
0.08 election volt (discussed in Chapter 9). The wave function overlaps 
a large number of silicon atoms as is indicated in Figure 1.14 which is 



PHOSPHORUS ATOM 
HAS EXCESS + CHARGEs 



EXCESS ELECTRON 
,FROM PHOSPHORUS ATOM 




APPROXIMATE SCALE OF WAVE FUNCTION 
INDICATED BY DOTS 



Fig. 1-14 — Wave Function of Electron Bound to Phosphorous Atom in Silicon. 

approximately to scale. The fact that the wave function extends over so 
many atoms suggests that its detailed behavior around the impurity 
atom itself is unimportant and that the binding energy produced by the 
impurity is due almost entirely to its charge. This explains the observed 
fact that all donor and acceptor impurities produce about the same binding 
energy for electrons and holes. The difference in binding energy between 
electrons and holes is explained by Pearson and Bardeen on the basis of a 
difference in effective mass, a concept discussed in Chapter 7. Further- 
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more, they find the activation energies quoted above only for their 
purest specimens ; for others, the activation energy is less, a trend which 
they explain by considering the effects of other impurity atoms on an 
electron trying to escape from one impurity atom. 

When relatively large concentrations of impurities are present, new 
effects occur; these are illustrated in Figure 1.11 by samples C and D in 
which the number of electrons is substantially independent of temperature. 
The explanation of this behavior is that the impurities have moved so 
close together that appreciable overlapping of the impurity wave functions, 
such as those shown in Figure 1.14, occurs. Under these conditions the 
extra electrons and the impurity atoms play much the same role as the 
electrons and ions in a metal, and the excess electrons form a degenerate 
electron gas which moves through the irregularly distributed array of 
positive ions. The theory of electrical conductivity for these semi- 
conductors is very similar to that of metallic alloys, as is discussed in some 
detail in Chapter 11. The conductivity of the semiconductors is much 
smaller than that of the metallic alloys, however, because the density of 
carriers is much smaller; aside from this difference, the behavior of C 
and D are substantially like metallic alloys for low temperatures. As 
the intrinsic range is approached, however, the number of carriers and 
the conductivity increase rapidly with increasing temperature. 

The results described for silicon for the case of electrons in connection 
with Figures 1.9 to 1.14 are obtained in substantially the same form for 
holes in silicon 5 and for electrons and holes in germanium. An important 
and characteristic feature of germanium is that the energies required to 
remove holes from acceptors and electrons from donors are so small that 
a negligible fraction of the excess electrons or holes are bound to the impurities 
at room temperature. In other words, the donors and acceptors are effec- 
tively fully ionized. This fact simplifies considerably the theory of trans- 
istor electronics for germanium. 

Germanium samples show the interesting property that heat treatment 
at temperatures above 500°C appears to produce p-type impurities. 
Prolonged heat treatment at lower temperatures causes these impurities 
to disappear. One theory is that these effects are not due to chemical 
impurities at all but instead to disorder in the germanium lattice. The 
total number of disordered atoms predicted from theory is in approximate 
agreement with the concentrations observed. 6 

5 G. L. Pearson and J. Bardeen, Phys. Rev. 75, 865 (1949). 

6 The fraction of atoms disordered at temperature T should be approximately exp (-&/kT) 
where 8 is the energy required to disorder an atom. From data on self-diffusion in metals 
U. Steigman, W. Shockley, and F. C. Nix, Phys. Rev. 56, 13-21 (1939)], we may estimate that 
8 is between !0kT m and 20kT m , where T m is the melting point. This leads to values of 
10 -7 to 1(T 10 for the exponential. The observed values correspond to about 10 acceptor 
centers or a fraction of about 10 -8 of the atoms. 
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Evidence that disorder in the lattice produces p-type conductivity in 
germanium has been obtained by K. Lark-Horovitz and his colleagues. 7 
When high resistivity «-type germanium is bombarded with alpha particles 
or deuterons, it is converted to ^>-type, whose conductivity increases with 
increasing exposure to bombardment. It has been established that this 
effect is not due to transmutation of the germanium to another element 
but instead to the creation of disorder in the crystal, that is, germanium 
atoms knocked into interstitial positions leaving vacant lattice points 
behind. These "lattice defects" act as acceptors. There appears to be 
no reason why the wave functions around these defects should be like 
those shown in Figure 1.14, and, consequently, one might expect such 
samples to show different activation energies at low temperatures. The 
data on this subject are reviewed in Chapter 9. 

Before closing this introduction, some additional experimental evidence, 
which supports the general picture that conductivity is produced by sub- 
stitutional impurities of the sort described, should be presented. There 
are three types of evidence which indicate that the impurity atoms enter 
the lattice substitutionally in lattice sites normally occupied by atoms of 
the pure element rather than interstitially by squeezing into vacant spaces 
in the lattice: 

I. The conductivity produced by the impurities is of just the sort which 
would be expected on the basis of their ability to complete or not to com- 
plete the valence bond structure which would surround them if they 
occupied normal sites. (See Table 1.1.) On the other hand, if they were 
to enter the lattice interstitially, they would not fit into the valence-bond 
scheme at all, and there would be no reason for supposing that consistent 
p- and w-type behavior would occur for elements with valences of three 
and five. 

II. The activation energies required to remove electrons from the 
impurities, as quoted from the work of Pearson and Bardeen, show good 
agreement with the value to be expected theoretically on the basis of the 
wave functions of Figure 1.14. 

III. There is evidence obtained by X-ray studies of the lattice constant 
for the more concentrated alloys of silicon with phosphorus and boron. 
The lattice constant of these alloys has been studied as a function of 
concentration and is found to decrease in essentially the way that would 
be expected if the phosphorus and boron atoms entered the lattice sub- 
stitutionally, these atoms being in both cases smaller than the silicon 
atom. However, if they were to enter the lattice interstitially, it 
would be expected that they would tend to expand the lattice and pro- 

7 K. Lark-Horovitz, E. Bleuler, R. Davis, and D. Tendam, Phys. Rev. 73, 1256A (1948). 
R. E. Davis, V. A. Johnson, K. Lark-Horovitz, and S. Siegel, Phys. Rev. 74, 12S5A (1948). 
V. A. Johnson and K. Lark-Horovitz, Phys. Rev. 76, 442-443 (1949). 
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duce an increase in lattice constant in contradiction with the experi- 
mental facts. 

In the case of germanium it has been verified that even in some relatively 
pure samples, in which the impurity content is much too small to determine 
by conventional chemical analysis, the conductivity may be controlled by 
minute amounts of added impurities. This investigation has been carried 
out with the aid of radioactive antimony which has been added to melts 
of germanium. It has proved possible to measure the antimony content 
by radioactive means and to establish that there was good agreement 
between the antimony content and the number of electrons produced. 8 

Problems 

1. An ingot of germanium is formed by melting together 100 gm of 
germanium and 3.22 X 10 -6 gm of antimony. Assuming that the 
antimony is uniformly distributed, show that the density of antimony 
atoms is 8.70 X 10 14 cm -3 . [The density of germanium is 5.46 gm/cm 3 
and its atomic weight is 72.60; the atomic weight of antimony is 121.76. 
One gm atom, that is 72.6 gm of germanium, contains Avogadro's number 
( = 6.02 X 10 23 ) atoms. There is a slight discrepancy between the density 
of germanium atoms deduced from X-ray measurements, Figure 1.1, and 
from density and atomic weight.] 

2. Assuming that the excess electrons due to the antimony atoms of 
problem 1 are fully excited at room temperature and that their mobility is 
3600 cm 2 /volt sec, show that the conductivity of the ingot of problem 1 is 
0.50 ohm -1 cm -1 or 50 ohm -1 meter -1 in M.K.S. units. Hence show that 
a specimen 1 mm square in cross-section and 2 cm long would have a resist- 
ance of 400 ohms. 

3. Assume that the ingot contains 0.78 X 10 -6 gm of gallium, atomic 
weight 69.72, in place of antimony. Show that the density of gallium 
atoms will be 3.68 X 10 -14 cm -3 . Taking n P as 1700 cm 2 /volt sec show that 
the conductivity will be 0.10 ohm -1 cm -1 . 

4. Assume that both the gallium and antimony discussed above are added 
to the same 100 gm ingot. Why will the conductivity be 0.29 ohm -1 cm -1 
and not 0.6 ohm -1 cm -1 ? Will the material be »-type or p-type ? 

5. Assume that a flash of light falls uniformly on the specimen of problem 
2 and that a total number, 1.74 X 10 13 , of photons are absorbed and that 
each produces one hole-electron pair. Show that the resistance will drop 
to 162 ohms. 

6. Show that if 10 volts are applied from end to end of the specimen of 
problem 2, an electron drifts with a speed of 18,000 cm/sec and requires 
1.11 X 10 -4 sec to traverse the length of the specimen. 

» G. L. Pearson, J. D. Struthers, and H. C. Theuerer, Phys. Rev. 75, 344 (1949), 77, 809-813 
(1950). 



CHAPTER 2 

THE TRANSISTOR AS A CIRCUIT ELEMENT 

2.1 IDEAS ABOUT AMPLIFICATION PRIOR TO THE 
TRANSISTOR 

2.1a. Introduction. In Chapter 1, we have presented a description of 
the behavior of holes and electrons appropriate to circumstances in which 
their distributions in space and time are substantially uniform. In order 
to deal with transistor action, however, and with related phenomena in 
semiconductors, we must describe what happens when the concentrations 
of both holes and electrons vary with position and time. Studies of such 
phenomena have been greatly facilitated from the experimental point of 
view by a series of new techniques and experimental procedures developed 
in connection with transistor research. The experiments permit the 
isolation and individual analysis of a number of the separate processes 
which go on in connection with transistor action. For this reason we shall 
discuss them in Chapter 3 before presenting in Chapter 4 the theory for the 
conventional or type-A transistor 1 invented by J. Bardeen and W. H. 
Brattain. 

In this chapter we shall review briefly the history of semiconductor 
development and research, especially in connection with amplifying 
devices, and show how this was culminated by the invention of the trans- 
istor. We shall then give a description of the transistor itself from a 
phenomenological point of view and describe its equivalent circuit. The 
exposition in later sections is directed in considerable measure towards 
giving a physical meaning to the purely formal equivalent circuit. 

The general level of presentation aimed at in Part I is qualitative so 
far as the theory of holes and electrons is concerned, and an attempt has 
been made to present the necessary semiconductor theory in descriptive 
terms covering the thermal generation of holes and electrons, their recom- 

1 The first authoritative treatment of the type-A transistor is that of J. Bardeen and 
W. H. Brattain, "Physical Principles Involved in Transistor Action", Phys. Rev. 75, 
1208-122S (1949). Other publications are as follows: J. Bardeen and W. H. Brattain, "The 
Transistor: A Semi-Conductor Triode", "Nature of the Forward Current in Germanium 
Point-Contact Rectifiers", Phys. Rev. 74, 230, 231 (1948); R. M. Ryder, "The Type-A 
Transistor", Bell Laboratories Record 27, 89-93 (March, 1949); J. N. Shive, "The Double 
Surface Transistor", Phys. Rev. 75, 689-690 (1949); J. A. Becker and J. N. Shive, "The 
Transistor; A New Semi-Conductor Amplifier", and R. L. Wallace and W. E. Kock, "The 
Coaxial Transistor", both in Electrical Engineering 68, 222-223 (1949); R. M. Ryder and 
R. J. Kircher,"Some Circuit Aspects of the Transistor", Bell Syst. Tech. J. 28, 367-400 (1949) . 
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bination, and their motions in electric and magnetic fields. The energy 
band theory described in Parts II and HI has been developed especially 
for such situations; however, it is not necessary to invoke the general 
theory in order to get a good mental picture of the important processes 
in transistor action. In some cases we have given quantitative results 
and formulae; however, in order to make this material available at an 
early date in the transistor art, it has not been feasible to prepare a general 
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Fig. 2-1— Rectifiers and V-I Plots. 

exposition of the quantitative theory of transistor action to the extent that 
has been attempted for the underlying theory of holes and electrons. 

The basic idea that semiconductors may be made to amplify electrical 
signals is relatively old and, as we shall discuss in the next paragraph, 
amplification by negative resistance thermistors had actually been achieved 
prior to the invention of the transistor. The similarity of the current- 
voltage curves for vacuum-tube diodes and for crystal rectifiers, Figure 2.1 , 
suggested to many people that it might be possible to make an amplifier 
from a crystal diode by somehow adding a grid. The desired aim has now 
been accomplished by the transistor, but, as we shall discuss, the amplifying 
action does not employ a grid. Another striking difference is that, in a 
vacuum-tube diode, it is the forward current that is controlled by the 
insertion of the grid. In the transistor, on the contrary, the output contact 
is biased in the low-current, high-resistance direction, and the current is 
enhanced and controlled by the input contact. 
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Amplifying action by use of the negative-resistance characteristics of 
thermistors had been achieved and analyzed relatively early in the 
thermistor development work. The negative resistance of a thermistor is 
made possible by the negative temperature coefficient of resistance of semi- 
conductors such as, for example, that shown for the intrinsic range in 
Figure 1.10. As the current through a thermistor is increased, the voltage 
at first rises in accordance with Ohm's law. For large currents, however, 
appreciable heating of the thermistor results, thereby causing a decrease 
in resistance, and the current-voltage characteristic becomes nonlinear. 
A sufficiently high current increases the temperature so much that the 
resulting drop in resistance may actually produce a voltage decrease and, 
consequently, a negative differential resistance occurs. This negative- 
resistance characteristic can be used to make oscillators or networks with 
gain rather than attenuation, provided the frequencies are low enough 
so that the temperature of the thermistor can follow the variations of 
current. This requirement sets an upper frequency limit which depends 
on the thermal time constants under the operating conditions. By making 
the physical dimensions small and the thermal conductivities high, stable 
oscillations have been produced at frequencies as high as 100,000 cycles 
per second. 2 However, there is an additional mechanism involved which 
will limit the frequency no matter how small the structure is made: Time 
is required for the numbers of electrons and holes to come to equilibrium 
after the temperature is changed. Not much is known about this time 
for most semiconductors. We shall discuss later in Section 3. Id some 
data bearing on this time for germanium at room temperature. 

2.1b. Modulation of Conductivity by Surface Charges. In the Solid 
State Research Group at Bell Telephone Laboratories, the hope of dis- 
covery of a purely electronic, rather than thermal, semiconductor amplifier 
was bolstered by the discovery of what might be called an existence theorem, 
to use a mathematical term, that such an amplifier was possible, or at least 
not contradictory to the theories of semiconductors and statistical me- 
chanics. The particular device envisaged is shown in Figure 2.2. It 
consists of a very thin layer of semiconductor placed on an insulating 
support. This layer of semiconductor constitutes one plate of a parallel- 
plate capacitor, the other being a metal plate in close proximity to it. If 
this capacitor is charged, as shown in part (c), with the metal plate 
positive, then the additional charge on the semiconductor will be repre- 
sented by an increased number of electrons. At room temperature, in 
germanium, a negligible number of these electrons will be bound to the 
donors (see Section 10.4 for details). Consequently, the added electrons 
should be free to move and should contribute to the conductivity of the 
semiconductor. In this way, the conductivity in the semiconductor can 

2 Personal communication from J. A. Becker. 
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be modulated, electronically, by a voltage put on the capacitor plate. 
Since this input signal requires no power if the dielectric is perfect, power 
gain will result. 

The effects to be expected from such an arrangement are very appreciable, 
a conclusion which we shall illustrate for its own sake, and also for the 
purpose of showing how concentrations of electrons and holes may be 
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Fig. 2-2 — Modulation of Conductance by Surface Charges. 
(The extra electrons induced on the surface in (c) carry an added current.) 

used in calculations. Suppose the layer consists of w-type germanium 
with a resistivity of 2.4 ohm cm. 3 The concentration of electrons is 
obtained from the formula for conductivity in ohm -1 cm -1 (or coulombs/ 
volt sec cm) 
1 



= neii, = n cm -3 X 1.6 X 10 -19 coulombs X 2600 cm 2 /volt sec 
= n X 4.2 X 10~ 16 coulombs/volt sec cm 



(1) 

3 In M.K.S. units, this is a layer 10~' m = T thick with conductivity a = 10 2 /2A = 
42 ohm -1 m _l with electrons having mobility of 0.26 m 2 /volt sec. The number of electrons 
per unit volume is given by a = ney. so that n = 42/0.26 X 1.6 X 10~ 19 m -3 giving 1.0X 10 14 
per square meter of surface. A field of 30,000 volts/cm or 3 X 10 6 volts/m produces a surface 
charge for a medium of dielectric constant 2 of 2 X eo X 3 X 10 6 = 5.3 X 10~ 5 coulombs 
m -2 or 3.3 X 10 u electrons m -2 . 
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giving 



n = 1/2.4 X 4.2 X 10 -16 = 1.0 X 10 15 electrons per cm 3 . 



(2) 



In a layer T = 1000^ = 10~ 5 cm thick there will thus be nT = 1.0 X 10 10 
electrons per cm 2 . We shall next calculate how many electrons will be 
induced by a field in the dielectric of 30,000 volts/cm or 100 electrostatic 
volts/cm. This field will produce a displacement of kE/At electrostatic 
coulombs. Taking k = 2, for a typical dielectric between the capacitor 
plates, we calculate a charge of 2 X 100/4tt =16 stat. coulombs per cm 2 , 
corresponding to 16/4.8 X 10~ 10 = 3.3 X 10 10 electrons per cm 2 , or 
three times the number normally present. Such a field should thus 
quadruple the conductivity of the germanium layer. 

Experiments have been carried out with layers of various semiconduc- 
tors, and effects of the sort discussed have been observed. However, the 
degree of modulation has been somewhat less than that calculated above. 
For layers of germanium about 5000A thick, in which the mobility was 
only about 40 cm 2 /volt sec, it was apparent upon analysis of the data that 
only about 10 per cent of the induced charges (holes in this instance since 
the evaporated germanium was p-type) were effective in changing* the 
conductance. 4 This reduced effectiveness can be accounted for on the 
basis of a theory dealing with the behavior of the current carriers at a semi- 
conductor surface. 

2.1c. Bardeen's Theory of Surface States. J. Bardeen has proposed 
that the ineffective portion of the induced charge is lodged in states localized 
at the surface, employing for this purpose his theory of surface states, which 
he has so fruitfully applied to the explanation of a number of otherwise dis- 
connected facts about semiconductor surfaces. A thorough discussion of 
his surface state theory would require the mathematics of energy bands and 
Fermi-Dirac statistics to an extent not in keeping with the level of exposi- 
tion of Part I. We shall effect a compromise by quoting conclusions based 
on his theory as it applies to the modulation of conductivity by surface 
charges without elaborating upon the underlying reasoning. We shall 
adopt the same procedure later in discussing the nature of transistor action 
with metal point contacts. 

The conclusion reached by Bardeen is that electrons which move in the 
body of the semiconductor can become tightly bound in surface states on 
the surface of the semiconductor and thus become immobilized. The 
Surface States Diagram, Figure 2.3, represents the free surface of an w-type 
semiconductor and shows, in part (a), eight electrons trapped in the sur- 
face states. These electrons repel other electrons in the conduction band 
and thus produce a layer of depleted conductivity below the surface. In 
this layer there are eight unneutralized donors, so that the semiconductor 

4 W. Shockley and G. L. Pearson, Phys. Rev. 74, 232-233 (1948). 
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surface as a whole is neutral. (In the diagram the donor ions are arranged 
in a regular scheme so as to facilitate counting them; in a sample of ger- 
manium, such as we have just discussed, the donors will be distributed at 
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random and will be on the average about 500 times as far apart as normal 
atoms in the lattice.) In part (b) the net charge density is plotted as a 
function of distance below the surface. This charge density gives a 
variation of electrostatic potential from point to point in accordance with 
the energy curve for an electron shown at (c). The effect of the repulsion 
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of the electrons in the surface states and the attraction of the unneutralized 
donors are represented by the approximately parabolic curve of energy 
versus distance. The surface states bind the electrons tightly in energies 
even lower than they would have in the conduction band, deep inside the 
crystal. Semiconductors, like metals, have work functions; the energy 
required to remove an electron from inside the semiconductor to a point in 
free space in front of it is represented by the height of the energy curve 
above zero on the left edge of part (c). 

It may be helpful at this point to make a few remarks on the relationship 
between energy of an electron shown in Figure 2.3 and the electrostatic 
potential. One obvious difference is that, because of the negative charge of 
the electron, the energy diagram and the electrostatic potential diagrams 
are inverted in respect to each other and electrons tend to fall uphill in 
an electrostatic potential diagram. A more important difference, especially 
when an electron moves near a polarizable surface such as that of a metal, 
is that the electron induces charges by its presence and thus changes the 
electrostatic potential. 5 The energy diagram includes these interactions 
so that the energies shown represent the way in which the energy of the 
entire system depends on the position of the electron. Not all of the energy 
difference between states arise from differences in electrostatic interactions, 
however. The change in energy between an electron in the conduction 
band and one in a deep lying surface state is in part electrostatic and in 
part kinetic. In the hydrogen atom of Figure 1.1, for example, the electron 
has a negative potential energy which has twice the magnitude of the 
kinetic energy. In general, differences in energy between different elec- 
tronic states involve mixtures of kinetic and potential energies and it is 
usually not practical or useful to try to sort them out. 

When the metal capacitor plate is charged positively, a compensating 
negative charge flows through the charging circuit and appears as six extra 
electrons on the surface of the semiconductor, in the Surface States Dia- 
gram, at (d). Four of these electrons go into the surface states, and two 
remain as excess electrons in the conduction band. The resulting modifi- 
cations of the space charge distribution (e) and energy for an electron (f) 
follow directly from part (d). The electric field between the capacitor 
plates is represented in (f) by the slope of the energy curve outside the 
semiconductor. The example shown in the Surface States Diagram is, of 
course, qualitative, and it is reasonable to suppose that 90 per cent of the 
charge goes into the surface states in accordance with the results measured 
by Pearson. 6 

5 See, for example, W. G. Dow, Fundamentals of Electronic Engineering, John Wiley & 
Sons, New York, 1937, Section 94. 

6 W. Shockley and G. L. Pearson, Phys. Rev. 74, 232-233 (1948). 
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A number of experiments have been performed which give information 
on various features of the surface state theory just presented. 7 

2.2 THE TYPE-A TRANSISTOR 
2.2a. Its Discovery and External Aspect. Experimental and theoretical 
work on the properties of surface states was given emphasis in the research 

COLLECTOR 




Fig. 2-4— Cut-away View of the Type-A Transistor. 

Two phosphor bronze wires .5 mils in diameter are pointed, bent and welded to the 
nickel mounting wires. The latter are previously molded into a plastic plug, which 
makes a light press fit in the mounting cylinder. The germanium wafer 20 mils thick 
and 50 mils square is soldered into a brass mounting plug which is also pressed into 
the cylinder After an electrical forming treatment the unit is vacuum impregnated 
with wax through the hole in the cylinder. (See Section 4-5 for some informatton 
on forming.) 

program at Bell Telephone Laboratories. Stimulated by Bardeen's theory, 
the group undertook a number of experiments to measure characteristics 
of the surface states. In many instances the effects sought were below the 
threshold of sensitivity of the methods employed. While exploring certain 
aspects of this problem, John Bardeen and Walter Brattain encountered 
' For a summary of several experiments, see the original paper by J. Bardeen; Ffys. .Rev. 
71 717-727 (1947). For information on contact potential see W. H. Brattain and W. Miock- 
ley Phys Rev. 72, 345 (1947). For photoelectric emission see L. Apker, E. Taft, and 
J Dickey Phys. Rev. 74, 1462-1474 (1948), which contains many references. 
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some new effects, and, branching off into a new area of theory and experi- 
ment, they invented the transistor. 1 

We shall next discuss the transistor from a phenomenological viewpoint 
and will present, thereafter, the theory of its operation. 

Figure 2.4 shows a cut-away view of a transistor. The schematic 
diagram of Figure 2.5 shows the bias battery supplies which activate the 
transistor so that it can amplify an input signal. The two point contacts of 
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Fig. 2-5 — Schematic Diagram of Transistor. 

The area of interaction which surrounds the emitter point, the conventional symbols 
and the sign conventions for current flow and voltage are shown. (Typical values 
for a type-A transistor are I t = 0.6 ma, V t = 0.7 volt, I c = -2.0 ma, V c = -40 
volts.) 

the transistor are, individually, rectifying points of the general type dis- 
cussed with Figure 2.1. Under operating conditions, the input or emitter 
point is operated in the plus (forward or low-resistance) direction and the 
output or collector is operated in the minus (reverse or high-resistance) 
direction. An important discovery of Bardeen and Brattain was that, 
when the input point is biased for forward current flow, it becomes sur- 
rounded by an area of interaction ; if the output point is placed within this 
area, the input current controls the output current in such a way that power 
gain results. The region of interaction is not sharply defined, and the power 

1 For additional details of this history together with a discussion of many theoretical and 
experimental aspects of transistors, see J. Bardeen and W. H. Brattain, "Physical Principles 
Involved in Transistor Action," Phys, Rev. 75, 1208-1225 (1949). 
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gain in one conventional circuit arrangement varies from about 100-fold 
(or 20 db) at 0.005-cm spacing to unity or (0 db or unit power transmission) 

at about 0.025 cm. , • , • i_ • 

The theory of transistor action, which follows later in this chapter, is 
that the emitter point emits holes into the »-type material, which other- 
wise would contain only electrons. (A few holes are normally present, see 
Chapter 10, but they make a negligible contribution to the conductivity.) 
These holes are drawn to the negative collector point and add to its current. 
In Section 3.1 we shall describe experiments which are consistent with this 
picture and give detailed information about the numbers and behaviors of 
the holes. In this section we shall describe the transistor characteristics 
and show how they lead to power gain, postponing a further description of 
the physical theory of the type-A transistor itself until the simpler cases 

have been studied. . 

2 2b. The Static Characteristics. The transistor, being a three-terminal 
device' requires the specification of two voltages and two currents to 
describe its operating condition. The conventions selected for this purpose 
are those standard for three-terminal networks. As shown in Figure 2.5, 
the voltages are considered as positive when positive in respect to the base, 
and the currents are considered as positive when they flow from metal 
point to semiconductor (that is, when electrons flow from semiconductor 
to metal) This situation is more complicated than for a conventional 
vacuum-tube triode because for the latter the grid current is usually taken 
as zero so that it suffices for its static characteristics to specify the plate 
current as a function of plate and grid voltages. In the transistor, values 
for both voltages and both currents are significant. There is another 
important difference: For a given set of voltages, there may be two or three 
possible sets of currents in the transistor. That is, the currents are multi- 
valued functions of the voltages. (The reverse is true in a vacuum tube 
suffering from grid emission, and there may be two grid voltages for a given 
grid current.) However, for the transistor, there is only one set of voltages 
for a given pair of currents in the normal operating range. It is thus 
appropriate in the case of the transistor to express the voltages as functions 

of the currents. 

The symbols chosen for emitter, base and collector are e, b and c. Ihe 
choice of e rather than e reduces the likelihood of confusion with c when 
subscripts are used, as the reader may verify by considering V V e and V*. 

Figure 2.6 shows a plot for a typical transistor, for d-c conditions. 1 he 
two upper plots show V t as a function of the currents, two conventions of 
presentation being employed, and the lower plots similarly show V. 
These particular plots have been chosen for exhibiting transistor char- 
acteristics because of the convenient relationship of their slopes to paranv 
eters in the small signal theory of transistor operation. 
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One extra line is shown in the upper left plot, which corresponds to a 
fixed collector voltage of -40 volts. This line has been extended beyond 
the range of the operating characteristics. It shows that, at fixed collector 
bias, the emitter voltage first increases with emitter current and then, 
somewhat beyond the operating range, it decreases. In other words, there 
are two sets of values of emitter current and collector current which cor- 
respond toF c = -40 volts and V t = -0.1 volt. In a presentation of the 
data in the form of currents as functions of the voltages, this multivalued 
feature would cause considerable inconvenience; so far as the operation of 
the transistor is concerned, however, the situation is even more serious. If 
the transistor were operated at these two voltages, that is, from low- 
impedance external sources, it would be conditionally stable at the first 
point and unstable at the second. If a transient impulse should happen to 
take it out of the stable range, it would tend to run away to currents still 
higher than the upper point and a burned-out unit would result. To avoid 
such damage, transistor circuits are designed so as to eliminate the insta- 
bility in amplifiers and limit the currents in flip-flop circuits and thus to 
overcome or to control the instability. In Section 4.1 (b) we shall show that 
the negative resistance in the input point is associated with a current 
amplification between input and output circuits and that the multivalued 
aspect of the plots is related to the decrease in the resistance of the rectifiy- 
ing emitter contacts as the emitter current increases. 

The static characteristic lines are of importance in determining the non- 
linear behavior of transistors and in designing switching and counting 
circuits using them. Some problems illustrating the principles involved 
are given at the end of this chapter. 

2.2c. The Equivalent Circuit. 2 If we write the functional relationship of 
Figure 2.6 in the form 

^=/i(/*,/c) a) 

Vc=MI t ,Ic), ( lb ) 

then the small a-c voltages v e and v c produced by small a-c currents i f and 
i c are 3 

o« =T7-i« + Tr* e = riii, + ri 2 i« (2a) 

alt dl c 

vc-^i, + d ^ic = r 21 i t + r 22 i c (2b) 

al ( dl c 

2 General reviews of the theory of networks will be found for passive circuits in E. A, 
Guillemin, Communication Networks: Vol. II, John Wiley & Sons, New York, 1935 and foi 
active circuits in L. C. Peterson, Bell Syst. Tech. J. 27, 593-622 (1948). 

3 In this section we shall assume that the frequency is so low that all circuit parameters arc 

resistive. 
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where the r's, which are abbreviations for the partial derivatives, are 
simply the slopes of the curves of Figure 2.6. A system described by the 
equations just given can be represented as a passive network of resistances 
if ri2 = r 2 \ and ru > r i2 < r 2 i\ otherwise negative resistances or voltage 
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Fig. 2-7 — Equivalent Circuits for the Transistor and Relationships 
Among the Parameters. 



(Generators inside dashed rectangles are equivalent if r„. 



a.r c .) 



or current generators must be introduced. Figure 2.7 shows three equiva- 
lent circuits together with the relationships between the various parameters 
involved. 

The quantity a 

a = _r^i = 5i ( 3) 

is a measure of the effect which the hole current from the emitter has upon 
the collector. We shall discuss its physical meaning in later sections. The 
quantity r 12 , which represents a reaction at the emitter produced by the 
collector, is called the base resistance; it represents a positive feedback in 
the grounded base amplifier circuit and, as such, it increases the gain and 
decreases the stability of the circuit. 

In Figure 2.7(b), the resistance r m in the voltage generator v m i ( is the 
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active mutual resistance. As we shall see at the end of this section, the 
power gain is approximately proportional to r m . 

From the static characteristics shown in Figure 2.6, values of the quanti- 
ties r n , etc., can be obtained for any operating point. The following table 
gives typical values together with other information on operating con- 
ditions. 

Table 2.1. Preliminary Data for Type-A Transistor 

. . Average Equivalent Circuit 

Typical Operating Conditions Parameters (ohms) 



Emitter Current 0.6 ma m = 530 Emitter Resistance r, = 240 

Emitter Voltage 0.7 volt r ]2 = 290 Base Resistance r b = 290 

Collector Current.... -2 ma r 21 = 34,000 Collector Resistance r c = 19,000 
Collector Voltage.... -40 volts r 22 = 19,000 Mutual Resistance r m = 34,000 

: a = ri\/rii= 1.8 



Maximum Ratings: Not to be exceeded in continuous operation. Voltages relation 
to base. 

Collector Voltage -70 volts 

Collector Dissipation 0-2 watt 

Grounded Ease Operation: Class A, working from a 500-ohm generator into a 20,000- 
ohm load. 

Operating Power Gain ~ 17 db 

Power Output ~ 5 mw 

From this data it is evident that the input impedance is of the order of 
hundreds of ohms. From the values of a it is seen that the output current, 
in equivalent circuit (c) for example, is even larger than the input current 
and is available from an impedance of the order of 20,000 ohms. From this 
it is at once evident that there is the possibility of power gain. 

The source of the power gain resides in the current or voltage generators 
of the equivalent circuits of parts (b) and (c) of Figure 2.7. (Power gain 
may occur even if a e is less than unity.) In later sections we shall discuss 
the internal mechanism of the transistor and relate the physical picture to 
the equivalent circuit so that the mechanism of power gain will be given a 
physical interpretation. In this section, however, we shall deal with the 
equivalent circuit per se making no use of the underlying physics peculiar 
to transistors. The treatment given, however, will reflect an interest in 
the later interpretation, and partly for this reason no use will be made of 
the specialized techniques of circuit theory. Although much of the discus- 
sion of this section can be compactly expressed in terms of results of the 
theory of feedback amplifiers, little use of these ideas and terminology is 
introduced. On the one hand, the reader familiar with these ideas will 
have little difficulty in recasting the discussion in terms of general circuit 
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theory and, on the other hand, it would be inappropriate to incorporate a 
sufficient discussion of circuit theory to make it of benefit to the reader 
primarily concerned with the fundamental mechanisms of semiconductors. 
To the latter the circuit characteristics of the transistor are of interest 
chiefly as manifestations of more basic processes. 

As an example of the use of the equivalent circuit and as a further 
exhibition of the behavior of the type-A transistor, we shall carry out 
calculations for the transistor as a grounded base amplifier. This is, of 
course, only one of a number of the possible circuits in which the transistor 
may be used. The grounded base amplifier is shown in Figure 2.8. The 

-ic 





(b) 



Fig. 2-8— The Transistor as a Grounded Base Amplifier. 



(a) Working from a signal generator, v g , r g into a load r L . 

(b) Equivalent output circuit. 

emitter is connected to an input generator v g , r g represented by a voltage 
generator v g in series with a resistance r g . The output load resistance 
is r L The bias power supplies are not shown. The generator has a 
certain power capacity which is defined as its available power. This power 
is defined as the maximum power which the generator can deliver and, as 
is well known from circuit theory, this power is a maximum when the 
generator looks into a matched load. For this case it has the value 

Vg 2 l& g = available power from input generator. (4) 

All power gains which we shall discuss will be defined in terms of the ratio 
of the output power to the power available from the input generator. 

The power gain in the grounded base circuit is obtained by a straight- 
forward analysis of the circuit of Figure 2.8. In order to describe die 
output from the transistor in terms of an equivalent generator v g , r g as 
shown in part (b), we shall carry out the analysis by writing down expres- 
sions for the voltages at terminals e and c rather than by applying Kirchhofi s 
law to the loops. We thus obtain for terminal e: 

v t = v g - r„i f (from generator) (5a) 

»e = r n i ( + rizie (f rom transistor). (5b) 



9 r r n + r g 
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Similarly we obtain for terminal c : 

v c = —TLic (from load) (6a) 

v c = r 2 \i e + r 22 i c (from transistor). (6b) 

We shall next solve the last equation by expressing i t in terms of v„. 
Having done this, we shall find that the equation for v c becomes very similar 
to the equation for the generator working into e so that we can interpret 
the transistor as converting the generator v g , r g into a new generator v g , 
r g . Solving the v ( equations for i t gives 

i ( = (v 9 - r 12 i c )/(r n + r g ). (7) 

Inserting this in the last equation gives 

/ ri2f2i \ . . r 2X 

Vc = I r 22 : ) U + 

\ r n + r g l 

= v g ' + r g 'i c , (8) 

the latter form corresponding to replacing the transistor by v g , r g , the 
change in sign in the r„'i e term as compared to r g i t in equation (5a) being 
due to the current convention (i f is current out of v g , r g and i c is current 
into r g ', ij). Hence we conclude that the equations 

Vg = r 2 iv g /(r n + r g ) (9a) 

r g ' = r 22 - ri 2 r 2 i/(r n + r g ) (9b) 

describe the transistor as viewed from the load. 

The power delivered by the generator into the load is the output current 
% c = Vg'/(r g ' + r£) operating into n,: 

r L ic 2 = r L { Vg '/(r g ' + r L )} 2 . (10) 

The ratio of this power to the available power of v g , r g is the operating gain: 

~ • n , s ?W 2 4r ff 
Operating gain = G(r ff , r L ) = , r^ — 2 

vg "r rL) V g 



Ar L r g 



fel— [tJ c. 



(r/ + r L f 
where 

A = (rn + r g )(r 22 + r L ) - ri 2 r 2 i (12) 

is the circuit determinant. As the notation G(r B , ri.) implies, the operating 
gain depends on the external input and output circuits as well as on the 
constants of the transistor. The problem of maximizing G(r g , a,) by 
proper choices of r g and tl will be discussed below. The circuit determi- 
nant is an index of the stability of the circuit. If A < 0, the circuit will 
spontaneously build up an oscillation even though v g = 0. If A > 0, the 
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circuit may be stable; however, the value of A given in (12) is the low- 
frequency circuit determinant, and its behavior at high frequency is 
involved in determining the stability of the circuit; we shall return to this 
point briefly below. 

Let us next consider the constant of the output generator in terms of the 
values of Table 2.1 The ratio of the voltage generators is, approximately, 

„„' r 21 34,000 ^ 3Q ^ (13) 



v g r u + r g 530 + r g 

the approximation corresponding to r g = 500 ohms, a representative value 
for many applications. The ratio v g '/v g would be the voltage gain mto 
an infinite impedance load and would be analogous to the y. of a tube. 1 he 
value of r g is influenced by the feedback due to r l2 and is 

T «r 12 1 |\ 1-8 X 2901 

= r 22 [l - 0.45] = 10,500 ohms, (14) 

the last two terms corresponding to r g = 500 ohms. It is evident that 
the effect of positive feedback through the term involving ar 12 acts to 
reduce the output generator impedance and thus to increase the power 
available from the generator v g ' , r/. In principle the generator impedance 
r ' can be reduced to zero or made negative by adding an external feed- 
back resistance to the base. This reduction is equivalent to increasing 
all of the r's in the equivalent circuit by the added resistance, the con- 
tributions being important chiefly for r u and r 12 . As soon as r g passes 
through zero and becomes negative, the impedance between terminals c 
and b, being r g 'r L /(r g ' + r L ), will also become negative, and oscillations 
associated with stray capacitances and inductances may occur. Ihis 
is an example of instability arising before the low-frequency A vanishes; 
A does not vanish until r g ' + r L = 0. The problem of finding the opamum 
value of the feedback resistance is similar to other problems of feedback 

amplifier design. 4 

Using the value 10,500 for rj found in (14) and the circuit constants for 
Grounded Base Operation of Table 2.1 we find from (11) that 

Operating gain = 47 or 16.7 db. (14a) 

Two other definitions of gain which are frequently referred to are the 
available gain and the maximum available gain. The available gain is 
found by adjusting r L so as to maximize G(r g ,r L ); this adjustment ob- 

*H. W. Bode, Network Analysis and Feedback Amplifier Design, D. Van Nostrand Co-, 
Inc., New York, 1945. 
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viously corresponds to setting vl = r g and gives 

G{r g , r g ) = -7 ^ — = available gain. (IS) 

r, LTn + rj 

For the values of circuit parameters previously used, its value is 
500 '34,0001 2 



10,500 L 1030 J 



52 or 17.2 db. (16) 



For the 20,000-ohm load resistor of the Table (which gives greater stability 
than the value 10,500 which maximizes the operating gain) the value of the 
operating gain is less by a factor equal to (11) -H (16): 

4rif g ' 4 X 20,000 X 10,500 



(r/ + r L f (30,500) 2 



= 0.9 (17) 



a loss of only 0.5 db. 

The available gain is still a function of the external circuit through r g . 
The maximum available gain is obtained by maximizing this available gain. 
When this is done, it is found that r g , as well as Vl, is matched to the 
transistor, and r g and rjr, have values 

r g = [(rnr 22 - r 12 r 2 i)rn/r 2 2) y2 (18a) 

tl = [(rur 2 2 - ri 2 r 2 i)r 22 /r n ] H . (18b) 

If the positive feedback is too great, the foregoing expressions become imag- 
inary. The critical condition is 

rnr 22 - r 12 r 2 i = r 22 (r n - ari 2 ) = 0. (19) 

In terms of the matched generator and load the maximum available gain is 
given in terms of r g and tl of (18a) and (18b) as follows: 

r 2 x 

maximum available gain = — • (20) 

(Tu +r g )(r 2 2 + r L ) 

Inserting the values from Table 1.3, we obtain 

r g = 76 ohms, ri, = 2750 ohms (21 ) 

and a maximum available gain of 88 or 19 db. The extra 2 db of gain as 
compared to the situation of r g = 500 ohms, ri, = 20,000 has been obtained 
by reducing the external resistances to such low values that the margin 
of stability has been reduced to a dangerous level. It is better, therefore, 
to sacrifice some gain for greater stability. 

For purposes of discussion in later sections we shall consider the maxi- 
mum gain neglecting the feedback term r J2 . Under these conditions, the 
values for r g and r^ for the maximum available gain reduce to rn and r 22 
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respectively, and the formula for maximum available gain becomes 

•i ui • r 2i 2 " 2r 22 , 9r , 

maximum available gain = = — V."J 

since r 2 i = «r 2 2- This formula indicates that large current amplification, 
represented by large a, and a large ratio of collector to emitter impedance 
both favor gain. 

It should be pointed out in closing these remarks on the equivalent circuit 
that we have discussed only the "grounded base" application of the trans- 
istor. This mode of operation is in a general way analogous to the grounded 
grid operation of a vacuum tube; in fact for a = 1, the circuit theories 
become very similar, at least at low frequencies. Two other arrangements 
are the grounded emitter, corresponding to grounded cathode, and the 
grounded collector, corresponding to the grounded plate or cathode-fol- 
lower circuit. The equivalent circuit of the transistor is, of course, equally 
applicable to all of these cases. For further circuit details the reader is 
referred to the literature. 5 

2.2d. Noise. The type-A transistor exhibits noise having the frequency 
spectrum associated with contact or current noise. The noise power per 
cycle varies inversely with frequency so that the noise per octave is the 
same for all octaves, at least over the frequency range for which the equiv- 
alent circuit parameters are independent of frequency. 

A detailed discussion of noise theory is beyond the scope of the present 
treatment. 6 We shall, however, discuss how the equivalent circuit may be 
modified to include the effect of noise. This can be done by adding two 
noise voltage generators of voltage v nt and v nc into the emitter and collector 
branches of the equivalent circuit as shown in Figure 2.9(a). 7 Typical 
values for the noise generators are as follows: the root mean square noise 
voltages in a 1 cycte per second band at 1000 cycles per second are 

v n( = 1 microvolt (23a) 

Vnc = 100 microvolts. (23b) 

Since the noise power varies as 1//, the values of v ne and v nc at another 
frequency,/, are (1000//) H times their values at 1000 cycles. 

In the grounded base amplifier circuit of Figure 2.8, the effect of v ne will 
be the same as o B ; and v nc will simply combine additively with v g ' . Hence 
the equivalent circuit looking into the transistor from the load will be as 

6 R. M. Ryder and R. J. Kircher, Bell Syst. Tech. J. 28, 367-400 (1949). 

6 For a discussion of the theory of noise in crystal rectifiers, see H. C. Torrey and C. A. 
Whitmer, Crystal Rectifiers, McGraw-Hill Book Co., New York, 1948, Chapter 6. 

7 L. C. Peterson, "Space-Charge and Transit-Time Effects on Signal and Noise in Micro- 
wave Tewodes", Proc. IRE, November 1947, 1264-1274. 
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shown in Figure 2.9(b) with v ng given by the equation 



Vng = 



+ V n 



r\\ + r g 
= 33p„ e + Vnc (24) 

where we have used the values in Table 2.1 for the circuit constants and 
500 ohms for r g . There is evidence that the noise generators v ne and v nc 
are "correlated" to a certain degree, indicating that for both of them the 
noise arises in part from a common source. For our example the contri- 
butions of v nc and v ni to v ng are respectively 100 and 33 microvolts. If 

(v) — *- (V) _ * (V) — - 




(v) (v) 





(a) 
Fig. 2-9 — Noise in Transistors. 

(a) Noise generators in the equivalent circuit. 

(b) Equivalent output circuit. 

they were perfectly correlated constructively, the contribution would be 
133 as compared to 67 for perfect negative correlation, a difference of 6 db 
in v ng . The correlation coefficient C of two sources is defined for our 
example by the equation 

v ng 2 = (33) 2 0„ e 2 + Vnc 2 + 2 X 33C Vv n < 2 v nc 2 . (25) 

Correlation coefficients ranging from —0.8 to +0.4 have been observed 
for type-A transistors. 

If correlation effects are neglected, the mean square noise voltage in 
frequency range from/t to/ 2 will be 8 

vng 2 (/ij 2 ) = (llOO0„ e 2 + tf„ c 2 )1000 ln./a//i. (26) 

Over the range of 1000 to 10,000 cycles this formula gives 

»n„ 2 (10) = (0.25 + 2.3) X 10~ 5 volts 2 = 2.5 X lO -5 volts 2 (27) 

8 The contribution of v nc to v ng 2 can be obtained by summing the contributions from each 
frequency band df, thus obtaining 

v HC 2 (1000/ /W = t^lOQOln, (ft/ A). 



f 

Jh 
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corresponding to an r.m.s. value of 5 millivolts. The quantity (10) in 
» n „ 2 (10) indicates that a frequency band one decade wide is involved. It 
is a characteristic of the 1// power spectrum that the noise power depends 
only on the ratio/ 2 //i and not on the location of either frequency alone. 

It is instructive to compare this with the value of v g ' when the maximum 
of 5 mw of undistorted power is being delivered to the r^ of 20,000 ohms. 
This comparison leads to a determination of v g as follows: 

c v m-3 - v ° 2rL = ^ /22Q » Q00 
X (r L + r>? (30,500) 2 

giving 



v 



,' 2 = 230 volts 2 (28) 



corresponding to an r.m.s. voltage of IS volts or about jjr the bias voltage on 
the collector. The ratio of this (voltage) 2 to the (noise voltage) 2 is 
9.2 X 10 6 , or 70 db. This figure is the most significant figure of merit for 
repeater applications, and we shall refer to it as the gain range decade 
figure. It may also be defined as the "signal to noise ratio at maximum 
output for a frequency band one decade wide". For our example, output 
signals greater than IS r.m.s. volts will produce distortion, and output 
signals less than 5 r.m.s. millivolts will be no larger than noise. Usable 
output signals will, therefore, lie in a range about 70 db wide, the exact 
value being set by the tolerance limits on signal-to-noise ratio and dis- 
tortion. 

The value of 70 db corresponds to a band one decade wide. For a wider 
band, there will be more noise power, and the gain range figure for the 
wider band will be smaller. For example, the band from 100 to 10,000 
cycles per second is two decades wide and has 3 db more noise so that the 
gain range figure is 67 db. The gain range figure for any band may be 
obtained by computing 10 times the logarithm to the base 10 of the number 
of decades in the band and subtracting this from the gain range decade 
figure. As another example consider the band from 5 to 6 mc. This 
contains log J- = log 1.2 = 0.080 decades. The gain range for this band 
would thus be 70 - 10 log 0.080 = 70 - 10(-1.1) = 81 db. 

The gain range figure is obviously closely related to the permissible 
attenuation between repeaters. If the attenuation exceeds the gain range 
figure, then either the output of one repeater is overloaded or the first stage 
of the next repeater will have a signal output smaller than noise, it being 
assumed that the problems of impedance matching have been solved. In 
an actual design problem the considerations would include impedance 
matching, tolerances for distortion and signal-to-noise ratio, and the total 
number of repeaters in tandem. 

Another quantity frequently used to describe noise is the noise figure. 
This is defined in terms of an amplifying circuit, like that of Figure 2.8, for 
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example, as follows: If the only source of noise were the thermal or Johnson 
noise in the external generator resistance, which we represent by r g , v nT 
(T for thermal), then a certain apparent noise generator v nT ' would appear 
in the output. The ratio of (v ng ) 2 actually observed in the output to 
(v nT ') 2 is the noise figure. The value of the noise figure depends on the 
particular circuit constants used in the generator and has a minimum value 
for a certain choice. We shall not give the theory of making this choice 
but will consider the general expression for the noise figure and then com- 
pute it for the previously discussed case in which r„ = 500 ohms. The 
formula for the thermal noise voltage at room temperature {kT/e = 0.025 
volt) is as follows: 

v nT 2 = 4r g kTAf = r g (4kT/e)(eAf) 

= r 9 (0.1 volt)(1.6 X 10- 19 A/amp) 

= 1.6 X 10 _2 % (ohms) A/ (sec -1 ) volts 2 . (29) 

Combining this with the expression for voltage gain and with the formula 
for the output generator due to v nf and v nc leads to the expression for the 
total output noise generator: 

Vn, 2 (fl,f2) = Mr-T ^^ _/l) 

+ ll-^yj °-« 2 © °- 2 ] 100 ° ln * </*//i) ( 30) 

and to a noise figure 

F = ! + A uj t f , l^ 2 © V^^ Vnc 2 ] 1000 ln e (/ 2 //x). (31) 



where the © sign indicates that correlation is to be taken into account in 
combining the terms. Rather than substituting directly in the formula, 
we shall deal with the case of/i = 1000 and/ 2 = 10,000 cycles/sec dis- 
cussed earlier using r g = 500 ohms. For this band, v n r 2 is 1.6 X 10~ X 
500 X 9,000 volts 2 and contributes 33 2 times this amount to v ng 2 or 
8 X 10 -11 volts 2 . The value 2.5 X 10~ 6 volts 2 due to the transistor is 
larger than this by a factor of 3 X 10 5 or 55 db and (because of the large 
ratio) this is also the noise figure. 

A noise figure of 55 db for the 1000 to 10,000 cycle band is much worse 
than that of a good electron tube, which can come close to db. In view 
of the frequency dependence which brings the transistor noise figure down 
to 30 db at a megacycle, the comparison at video frequencies is less un- 
favorable, particularly if some developmental improvement can be made. 

The noise figure depends upon the circuit used as well as on all the circuit 
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parameters. For further information, the reader is referred to the liter- 
ature. 9 

The noise figure is evidently an index of the amount by which the 
transistor falls short of being a noiseless amplifier. It is evident that no 
appreciable advantage will be achieved by lowering the noise added by 
the transistor lower than say 3 db below that contributed by the generator. 
Since the noise generator in the equivalent circuit is much larger than 
thermal noise in the corresponding resistive components of the equivalent 
circuit (and also much larger than shot noise), there appears to be no basic 
physical principle which will prevent lowering the noise in transistors 
greatly. 

Problems 

1. Consider the circuit in which the emitter of the transistor is grounded 
and the input signal is applied to the base from an input generator r g , v g 
and the output is delivered to a load tl- Show that the characteristics of 
this amplifier are as follows: 

Circuit determinant: 

A = (r g 4- r b + r t )(r L + r f + r c - r m ) + r t (r m 
> for stability 

Input impedance = r- ia = r b -\- r ( -\ ■ ■ ■ 

rh + r ( + r c - r m 

r t {r m - r t ) 



r.) 



Output impedance = r oui = r c + r e — r m + 



r g + r b + r t 



Operating gain = G F = ir g r L [(r m — r 6 )/A]" 
Backward operating gain = Gr = 4r 8 rz,(r e /A) 2 

Show that for the transistor of Table 2.1, r g = 500 ohms and rx, = 20,000 
ohms that ri n = 2100 ohms, r out = —6900 ohms, Gp = 24 db, Gr = — 19 db. 
2. Consider the grounded collector amplifier as for problem 1 and show 
that the equations determining its behavior are as follows: 

Circuit determinant: 

A = (r g + r b + r c )(r L + r t + r c - r m ) + r c {r m - r c ) 
>0 for stability 

Input impedance = r ia = r b + r c -\ ■ ■ 

rL + r ( + r e - r m 

_ . . . , fcVm T e ) 

Output impedance = r ou t = r t + r c — r m -\ — — — 

fg~Vf" b ~r fe 

Operating gain (forward) = G F = 4:r g r L (r c /A) 2 

Backward operating gain = Gr = 4r fl r L [(r m — r c )/A] 2 = (1 — a) 2 G F 

9 See R. M. Ryder and R. J. Kircher, Bell Syst. Tech. J. 28, 367-400 (1949). Formulae 
for several cases are quoted. 
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Show that for the transistor of Table 2.1, r g = 20,000 ohms, and r L = 
10,000 ohms that r in = -41,000 ohms, r out = -7500 ohms, G F = 15 db, 
G K = tt db. 

Remark: The following problems are intended to give the reader some 
familiarity with the magnitudes of voltages and currents and with the non- 
linear behavior of the transistor of Figure 2.6. 

3. Construct a plot of emitter voltage versus emitter current for V c = 
-10, -20 and -30 volts with the base grounded. {Hint: For fixed 
values of V c read pairs of values of I t and 7 C from Figure 2.6 and plot in 
the ru subplot.) Verify that near I, = +1.0 ma the input resistance is 
abowt 80 ohms. 

4. Construct a plot, similar to problem 3, of V t versus I t but assuming 
that V c is biased from a -60 volt battery through 20,000 ohms. {Hint: 
Draw a load line on the r 22 subplot and proceed as for problem 3.) Verify 
that for I ( in the range 0.5 to 1.5 ma the input impedance is about 300 
ohms. (This problem shows that adding resistance in the collector cir- 
cuit increases the input impedance of the emitter.) 

5. This problem is intended to show how adding resistance between 
base and ground can lead to a reduction in input impedance from positive 
to negative values. For this purpose consider the V c = —20 volt curve 
of problem 3. Verify that as I f increases from 0.5 to 1.0 ma, 7 C changes 
from -1.8 to -2.7 ma so that I b = - (7« + 7 C ) changes from +1.3 to 
+ 1.7 ma. Next suppose that a resistance is inserted between the base 
and ground and let V b equal the voltage between base and ground and 
V t ' = V t + V b ' and V c ' = V c + V b be corresponding emitter and col- 
lector voltages above ground. Conclude that if the inserted resistance is 
500 ohms, the increase of I t from 0.5 to 1.0 ma will produce a negative 
change in V b ' given by 500 X (1.7 - 1.3) X 10~ 3 = 0.2 volts and hence 
that V ( ' will decrease so that the input impedance is negative and equal 
to about -300 ohms. (The small correction due to the effect of V b on V c 
is negligible in this example.) 

6. Consider the V t versus I ( plot for a circuit in which the collector is 
biased at -40 volts and the base is connected to ground through 10,000 
ohms. For 7 e = 0, show that I c = -1.0 ma and V b ' = -10 volts and 
V^ = V b + V t = -10 + (-0.3) = -10.3 volts. For negative emitter 
currents, large negative emitter voltages will result. The other operating 
points are found by the following graphical construction on the r 2 i plot: 
For each value of 7 e , the equation 

-40 volts = 10,000(7. + Ic) + Vc{IJc), 

must hold; this equation states that the battery voltage must equal the 
sum of the drop across the resistance plus the drop between base and 
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collector. The equation may be rewritten as 

V c {IJc) = [-40 volts - 10,0007 c ] - 10,0007 e . 

For a fixed value of I c , the left side is simply one of the parametric curves 
of the r 2 i plot. For the same value of I c , the right side is simply a straight 
line of slope — 10,000 ohms on the ri\ plot. Thus for each value of I c , 
the solution is given by the point of intersection of the straight line with 
the parametric curve. Verify that the following points will be obtained. 
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The small difference between V\> and V t ' is due to V t . Estimate that the 
negative resistance of V t versus I ( will continue with a slope of about 
— 6000 ohms until V\, approaches —35 volts, corresponding to the bunched 
curves near the top of the r 2 i plot extrapolated to about I t = 3.5 ma and 
I c = — 7 ma. For higher values of I f , I c will increase only slowly, V c will 
change little and a positive resistance will be produced at the input. 
This shows that the P\ , I t plot is an A^-shaped curve so that for a range of 
fixed emitter biases there will be two stable solutions and one unstable 
solution. This feature can be used to make flip flop circuits by biasing the 
emitter to a negative voltage through a suitable resistor. (This treatment 
is due to B. G. Farley.) 

7. (a) A grounded base transistor amplifier is coupled through a trans- 
former to a generator v g of internal resistance r g . Show that if emitter 
noise is neglected, the greatest signal to noise ratio in the load is obtained 
when the transformer impedance ratio is r g : r\\. 

(b) Suppose that the generator is a typical moving coil microphone for 
which r g — 11 ohms and that when the microphone is picking up speech 
from a distance of 3 feet, t = 5X 10~ 5 volt. Show that when proper 
matching is used the signal to noise ratio in the load is approximately 
0.6 db. Assume a band extending from 50 to 10,000 cps. 

(c) Show by a qualitative argument that the final signal to noise ratio 
is not much different when the amplifier contains several stages provided 
the gain per stage is of the order of 20 db. 

8. In view of the results of problem 7, how much gain can be built into 
a transistor amplifier designed for the frequency range 50-10,000 cps if it is 
required that the noise at the output must be 40 db below the maximum 
signal output ? Assume that max. signal output is 10 milliwatts. 

Ans. Approximately 46 db. 

9. Consider a transistor operated as a grounded base amplifier with a 
load of 10,000 ohms in series with a 40 volt battery. 
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(a) Using the curves of Figure 2.6, find the maximum peak-to-peak 
output current which is comparatively free from distortion. 

(b) Assuming that the input (emitter) current is sinusoidal, what is the 
d.c. emitter bias ? 

(c) If the a.c. component of the input is removed, what is the power 
dissipation in the collector ? What is the power drawn from the battery ? 

(d) Let the a.c. component of the input be restored but in such a way 
that, unlike (b), the output current is sinusoidal With, the maximum peak to 
peak amplitude. What is the a.c. power delivered to the load ? 

(e) Under the conditions in (d), what are the collector dissipation and 
the power drawn from the battery ? 

Ans. (a), 2.2 ma.; (b), .75 ma.; (c), 39.6 mw.; 88 mw.; (d), 6.1 raw.; 
(e), 33.9 mw., 80 mw. 

10. Show by procedures similar to those of problem 7 that the signal 
to noise ratio for optimum matching is not affected by connecting several 
similar transistors in parallel; in other words, the noise figure of a matched 
transistor amplifier is unchanged by paralleling similar transistors. 
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CHAPTER 3 
QUANTITATIVE STUDIES OF INJECTION 
OF HOLES AND ELECTRONS 
3.1 CARRIER INJECTION IN GERMANIUM* 
3 la Introduction. The invention of the transistor 1 - 2 ' 3 has given great 
stimulus to research on the interaction of holes and electrons in semicon- 
ductors. The techniques discussed in this section for investigating the 
behavior of holes in »-type germanium were devised in part to aid in 
analyzing the emitter current in transistors. The early experiments of 
Bardeen and Brattain suggested that the hole flow from the emitter to the 
collector took place in a surface layer. 1 - 2 The possibility that transistors 
could also be produced by hole flow directly through u-type material was 
proposed in connection with the p-n-p transistor. 4 Quite independently, 
J N Shive 5 obtained evidence for hole flow through the body of K-type 
germanium by making a transistor with points on opposite sides of a thin 
germanium specimen. Such hole flow is also involved in the coaxial trans- 
istor of W. E. Kock and R. L. Wallace. 6 Further evidence for hole injec- 
tion into the body of w-type germanium under conditions of high fields was 
obtained by E. J. Ryder. 7 

In keeping with these facts it is concluded 3 that with two points close 
together on a plane surface, as in the type-A transistor, 8 holes may flow 
either in a surface layer or through the body of the germanium, tor sur- 
face flow to be large, special surface treatments appear to be necessary; 
such treatments were not employed in the experiments described in this 
section and the results are consistent with the interpretation that the hole 
current from the emitter point flows in the interior. 

*This section appeared in slightly modified form as a part of; a ^ paper bj r W. Shockley, 
G L. Pearson, and J. R. Haynes in the Bell Syst. Tech. J., 28, 344-365 (1949). 
'iT Bardeen and W. H. Brattain, Phys. Rev. 74, 230-231 (1948). 

* W H Brattain and J. Bardeen, Phys. Rev. 74, 231-232 (1948). 

' J Bardeen and W. H. Brattain, Phys. Rev. 75, 1208-1225 (1949). 

* W. Shockley, Bell Syst. Tech. J., 28, 453-489 (1949). 

5 J N. Shive, Phys. Rev. 75, 689-690 (1949). 

6 W E Kock and R. L. Wallace, Electrical Engineering, 68, 222-223 (1949). 
' E.'j. Ryder and W. Shockley, Phys. Rev. 75, 310 (1949). 

8 For reviews of the type-A transistor see Bardeen and Brattain, Phys. Rev. 75, 1208-1225 
(1949V R M. Ryder, Bell Laboratories Record, 27, 89-93 (March, 1949); J. A Becker and 
Jn Shive, Electrical Engineering, 68, 215-221 (1949); and R. M. Ryder, Bell Sys, Tech. J., 

28, 367-400 (1949). 
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The experiments described in this section, in addition to any practical 
implications, serve to put the action of emitter points on a quantitative 
basis and to open up a new area of research on conduction processes in 
semiconductors. It is worth while at the outset to contrast some of the 
new aspects of these experiments with the earlier experimental status of the 
bulk properties of semiconductors. Prior to the invention of the transistor, 
inferences about the behaviors of holes and electrons were made from 
measurements of conductivity and Hall effect. For both of these effects, 
under essentially steady state conditions, measurements were made of such 
quantities as lengths, currents, voltages, and magnetic fields. The meas- 
urement of time was not involved, except indirectly in the calibration of the 
instruments. Nevertheless, on the basis of these data, definite mental 
pictures were formed of the motions of holes and electrons describing in 
particular their drift velocity in electric fields and the transverse thrust 
exerted upon them by magnetic fields. 9 The new experiments show that 
something actually does drift in the semiconductor with the predicted drift 
velocity and does behave as though it had a plus or minus charge, just as 
expected for holes and electrons. In addition, experiments described in 
Section 3.2 show that the effect of sidewise thrust by a magnetic field ac- 
tually is observed in terms of the concentration of holes and electrons near 
one side of a filament of germanium. 

We shall discuss here evidence that holes are actually introduced into 
w-type germanium by the forward current of an emitter point and show how 
the numbers and lifetimes of the holes can be inferred from the data. We 
shall refer to this important process as hole injection. Discussions of 
the reasons why an emitter should inject holes have been given for metal- 
semiconductor point contacts by Bardeen and Brattain and for p-n junc- 
tions by Shockley. These theories are discussed in subsequent sections. 
There are other possible ways in which semiconductor amplifiers can be 
made without the use of hole injection into »-type material or electron 
injection into .p-type material. 10 In this chapter, however, our remarks 
will be restricted to semiconductors which have only one type of carrier 
present in appreciable proportions under conditions of thermal equilibrium; 
for such cases the theoretical considerations are simplified and are appar- 
ently in good agreement with the experiments. 

3.1b. Measurement of Density and Current of Injected Carriers. The 
experiment in its semiquantitative form is relatively simple and is shown 

9 In the sense of Chapter 14, these pictures amount to additional physical concepts which 
enter the theory but drop out again before Hall effect and conductivity predictions are made. 
The new experiments bear on other aspects of the physical concepts and give an operational 
reality to the pictures associated with drift velocity and Hall effect. 

10 For example see J. Bardeen and W. H. Brattain, Phys. Rev. 74, 230-231 (1948), and 
W Shocifey and G. L. Pearson, Phys. Rev. 74, 232-233 (1948). 
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in Figure 3 1 " A rod of »-type germanium is subjected to a longitudinal 
electric field E applied by a battery B 1 . Collector and emitter point 
contacts are made to the germanium with the aid of a micromanipulator. 
The collector point is biased like a collector in a type-A transistor by the 
battery B 2 , and the signal obtained across the load resistor R is applied to 
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Fig. 3-1— Experiment to Investigate the Behavior of Holes 
Injected into w-Type Germanium. 

(a) Experimental arrangement. . . 

(b) Signal on oscilloscope showing delay in hole arrival at h in respect to closing i, 
at h and delay in hole departure at h in respect to opening S at h. 

the input of an oscilloscope. At time h the switch in the emitter circuit is 
closed so that a forward current, produced by the battery B a , flows through 
the emitter point. At h the switch is opened. The voltage wave at the 
collector, as observed on the oscilloscope, has the wave form shown in 
part (b) of the figure. . . . 

These data are interpreted as follows: When the emitter circuit is closed, 
the electrons in the emitter wire start to flow away from the germanium 
(that is, positive current flows into the germanium). These electrons are 
furnished by an electron flow in the germanium towards the point of con- 

» Experiments of this sort were first reported by J. R. Haynes and W. Shocldey, Phys, 
Rev. 75, 691 (1949). 
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tact. The flow in the germanium may be either by the excess electron 
process or by the hole process. In Figure 3.2 we illustrate these two possi- 
bilities. At first glance it might appear that the difference between these 
two processes is unimportant since the net result in both cases is a transfer 
of electrons from the germanium to the emitter point. There is, however, 
an important difference, one which makes several forms of transistor action 
possible. In the case of the hole process an electron is transferred from the 
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Fig. 3-2 — Comparison between Electron Flow to and Hole Flow from a Point Contact. 

valence band structure to the metal. After this the hole moves deeper into 
the germanium. As a result the electronic structure of the germanium is 
modified in the neighborhood of the emitter point by the presence of the 
injected holes. 

Under the influence of the electric field E, the injected holes drift toward 
the collector point with velocity n p E, where n P is the mobility of a hole, and 
thus traverse the distance L to the collector point in a time L/p p E. When 
they arrive at the collector point, they increase its reverse current and pro- 
duce the signal shown at t^. 

There are two important differences between the signal produced at /2 
and that produced at t\. The signal at t\, which is in a sense a pickup sig- 
nal, would be produced even if no hole injection occurred. We shall 
illustrate this by considering the case of a piece of ohmic material sub- 
stituted for the germanium. Conventional circuit theory applies to such a 
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case • however, in order to contrast this purely ohmic case with that of hole 
ntction, we shall also give a description of the conventional theory of 
i Sr -mission in terms of the motion of the carriers According to 
conventional circuit theory, the addition of the current 7 e would simply 
produc an added IR drop due to current flow in the segment of the spec, 
men to the right of the collector. This voltage drop is denoted as I A in 
part b), R d representing the proper combination of resistances to take into 
Iccoun the way in which I. divides in the two branches. This signal will 
be transmitted from the emitter to the collector with practically the speed 
o igh "he ordinary theory of signal transmission along a conductor 
being applicable to it. This high speed of transmission does not, of course 
imply a correspondingly high velocity of motion of the current earners. In 
fTct the rapidity of signal transmission has nothing to do with the speed of 
he carriers and comes about as follows: If the ohmic material is an elec- 
ron c onductor, then the withdrawal of a few electrons by the emitter 
currnt produces a local positive charge. " This positive charge produces 
an Lctric field which progresses with the speed of hght and exerts a force 
on adiac nt electrons so that they move in to neutralize the space charge. 
The net result is that electrons in all parts of the specimen start to ^dnft 
practically instantaneously. They flow into the specimen from the end 
term nals to replace the electrons flowing out at the emitter pom and no 
appreciable chLge in density of electrons occurs anywhere within the 

^TtTistinction between the process just described and that occurring 
when holes are injected into germanium is of great importance in under- 
Jtanding many effects connected with transistor action. One way of sum 
mariz ng L situation is as follows: In a sample having carriers of one type 
"ectns for example, it is impossible to alter the d^^rners 
by t ying to inject or extract carriers of the same type The reason is, as 
Ascribed above (or proved in the footnote), that such changes would be 
Sompanied by an unbalanced space charge in the sample and such an 
unbalance is self-annihilating and does not occur. 

When holes are injected into »-type germanium they also tend £ set 
up a space charge. Once more this spacecharge is qu.ckly neutralized by an 
elect o flow. In this case, however, the neutralized state is not the norma 
trmal equilibrium state. Instead the number of current carriers present 

n This is a description in words of the result ordinarily jessed fc i te ri ns of *e_d£ectnc 

'tS^r^SL & = electrostatic potential, . - dielectric constant. The 
"-rtne^o^SuS^rtri^ by surface cha.es discussed in S ection,U 

hrSecSn. 7 Such spa£ charge layers are discussed bnefly » Sect 10 n «. 
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has been increased by the injected holes and by an equal number of electrons 
drawn in to neutralize the holes. The total number of electrons in the 
specimen will thus be increased, the extra electrons coming in from the 
metal terminals which complete the circuit with the emitter point. The 
presence of the holes and the neutralizing electrons near the emitter point 
modify the conductivity. As we shall show below, this modification of 
conductivity may be so great thart it can be used to measure hole densities 
and also to give power gain in modified forms of the transistor. We shall 
summarize this situation as follows: In a semiconductor containing sub- 
stantially only one type of current carrier, it is impossible to increase the total 
carrier concentration by injecting carriers of the same type; however, such 
increases can be produced by injecting the opposite type since the space charge 
of the latter can be neutralized by an increased concentration of the type normally 
present. 

Thus we conclude that the existence of two processes of electronic conduction 
in semiconductors, corresponding respectively to positive and negative mobile 
charges, is a major feature in several forms of transistor action. 

In a vacuum-tube triode there are also two ways in which electrons 
carry current: (1) ordinary metallic conduction in the leads and (2) space 
flow produced by thermionic emission. In Section 4.2 we shall show, by 
comparing a p-n-p transistor with a vacuum-tube triode, that there is a 
close analogy in the two cases between the ways in which the interaction 
of one form of current flow with the other produces amplification. 

In terms of the description given, the experiment of Figure 3.1 is readily 
interpreted. The instantaneous rise at t\ is simply the ohmic contribution 
due to the changing total currents in the right branch when the emitter 
current starts to flow. After this, there is a time lag until the holes injected 
into the germanium drift down the specimen and arrive at the collector. 
When the current is turned off at t-s, a similar sequence of events occurs. 

The measured values of the time lag of t\ — t2, the field E, and the dis- 
tance L can be used to determine the mobility of the holes. The fact that 
holes, rather than electrons, are involved is at once evident from the polarity 
of the effect; the disturbance produced by the emitter point flows in the 
direction of E, as if it were due to positive charges ; if the electric field is re- 
versed, the signal produced at t% is entirely lacking. The values obtained 
by this means are found to be in good agreement with those predicted from 
the Hall effect and conductivity data. The Hall mobility values obtained 
on single crystal filaments of n- and p-type germanium 1 * are 

Up = 1700 cm/sec per volt/ cm (1) 

n n = 2600 cm/sec per volt/cm. (2) 

The agreement between Hall effect mobility and drift mobility, as was 
pointed out at the beginning of this section, is a very gratifyingconfirmation 
14 G. L. Pearson, Phys, Rev. 76, 179 (1949). 
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of the general theoretical picture of holes drifting in the direction of the 
electric field. 15 

We shall next consider a more quantitative embodiment of the experi- 
ment just considered. In Figure 3.3, we show the experimental arrange- 
ment. In this case it is essential in order to obtain large effects that the 
cross section of the germanium filament be small. A thin piece of germa- 
nium is cemented to a glass backing plate and is then ground to the desired 
thickness. After this the undesired portions are removed by sandblasting 
while the desired portions are protected by suitable jigs consisting of wires, 
Scotch tape, metal plates, etc. After the sandblasting, the surface of the 
germanium is etched. In this way specimens smaller than 0.01 X 0.01 cm 
in cross section have been produced. The ends of the filament are usually 
made very wide so as to simplify the problem of making contacts. 

Under experimental conditions, a battery like B u of Figure 3.1, applies a 
"sweeping" field in the filament so that any holes injected by the emitter 
current are swept along the filament from left to right. In the small fila- 
ments used for these experiments, the resulting concentration of holes is so 
high that large changes in conductivity are produced to the right of the 
emitter point and, as we shall describe below, these changes can be measured 
and the results used to determine the hole current at the emitter point. In 
order to treat this situation quantitatively, we introduce a quantity 7 de- 
fined as follows: 

7 = the fraction of the emitter current carried by holes. (3) 

Accordingly, a current yl ( of holes flows to the right from e and produces a 
hole density, denoted by p, which is neutralized by an equal added electron 
density. A fraction (1 — 7)/* of electrons flows to the left; these electrons 
do not, however, produce any increased electron density to the left of the 
emitter since they are of the sign normally present in the w-type material. 
The presence of the holes to the right in the filament increases the con- 
ductivity <t [as shown in Figure 3.3(c)] both because of their own presence and 
the presence of the added electrons drawn in to neutralize the space charge 
of the holes. The mobility of electrons is greater than the mobility of holes, 
the ratio being 16 

b = tin/pp =1.5 for germanium (4) 

and the electrons are always more numerous than the holes 17 

n = n + p, (5) 1 

16 See Chapter 12 for a review of the most recent data. 

16 G. L. Pearson, Phys. Rev. 76, 179 (1949). See Chapter 12 for the most recent data. 

17 The notation used in the equations is as follows: n, p, «o = respectively density of 
electrons, of holes, of electrons when no holes are injected. Nd and N a are the densities of 
donors and acceptors, assumed ionized so that no = Nd — N a . I t , h, h are as shown on 
Figure 3.3. (I c used for the probe collector in Figures 3.1 and 3.9 does not enter the 
equations.) 

18 See Problems at the end of this chapter. 
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Fig. 3-3 — Method of Measuring Hole Densities and Hole Currents. 

(a) Distribution of holes, electrons, and donors. Acceptors, which may be present, 
are omitted for simplicity, the excess of donor density Na over acceptor density N a 
being tin. 

(b) To the right of the emitter the added hole density p is compensated by an 
equal increase in electron density. 

(c) The conductivity is the sum of hole and electron conductivities. 

(d) The total current h + I t to the right of the emitter is carried by I p and I n 
in the ratio of the hole to the electron conductivity. 
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where n is the concentration of electrons which would be present to neu- 
tralize the donors if p were equal to zero ; consequently, the current carried 
by electrons is greater than the current carried by holes. The concentration 
of holes diminishes to the right due to the fact that holes may recombine 
with electrons as they flow along the filament. 

From this experiment the value of 7 and the lifetime of a hole in the 
filament can be determined. The measurements are made with the aid of 
the two probe points Pi and P 2 , which draw no current and serve simply to 
measure the potential 19 on the two equi-potential surfaces V x and V % . The 
conductance of the segment of filament between these equi-potentials is 
obtained by measuring the voltage difference and dividing it into the 
current I b + 1 \. The conductance of the filament bejtween these points 
is obtained by measuring the voltage difference A.V and dividing it into the 
current lb + 1 '«. The necessary formulae for calculating hole density and 
hole current, shown on the figure, are derived as follows: 

Normal conductivity <r = en n n , (6) 

conductivity with holes present <j = ey. n n + e\i v p 

= ennitio +P)+ e\h>P = ff o[l + (1 + r^Cp/wo)]. (7) 

This relationship is plotted in part (a) of Figure 3.4 as a function of p/n . 
The conductance, 

g = (/« + i b )/*r, (8) 

between Pi and P 2 is proportional to the local conductivity, and hence to 

1 + (1+ J -1 )(*>/»„), (9) 

so that a measurement of the conductance gives a measurement of p/n . 
Letting G and G be the conductances between the points with and without 
hole injection, we have 

^. = ± == i + (i + r 1 )(p/»o) (io) 

Go cq 
or 

f - a ~ ff ° — (G/Go) ~ 1 (u} 

no ~ cr (l + r 1 ) ~ 1+lr 1 

The ratio of the hole current to the total current is, of course, simply the 
ratio of the hole conductance to total conductance. This ratio, I p /(I n + Ip), 

19 When the hole and electron densities in a semiconductor do not have their equilibrium 
values, the potential measured by a point contact will depend on the nature of the contact, 
the situation being remotely analogous to that corresponding to thermoelectric potentials. 
These internal contact potential differences are usually so small that they can be neglected for 
experiments of the sort described here. For a further discussion of the theory see W. Shockley, 
Bell Syst. Tech. J. 28, 435-489 (1949). 
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is plotted in part (b) of Figure 3.4. The value of I p may then be deter- 
mined graphically as follows: From a measurement of G/G , p/n is de- 
termined from part (a) ; from p/rio, the ratio I p /(I n + I p ) is then found 
from (b). Since 7„ + I p is known, this determines I p . The algebraic 
relationship corresponding to the procedure just outlined is as follows: 

Ip eix p p 



In + Ip eix n n + en P p bn + (1 + b)p 

p/no _ 1 ~ (Gq/G) , 

^[1 + (1 +^- 1 )(p/«o)] 1+* 



(12) 




n zn 

hole density, p 




n 2n 

HOLE DENSITY, p 



Fig. 3-4 — Dependence of Conductivity and Fraction of Current Carried by Holes 
Upon Hole Concentration. (Drawn for b = 1.5.) 

Hence from the measured values of G, it is possible to obtain the fraction 
of the current carried by holes. The hole current flowing past the probe 
points can be obtained by multiplying this expression by 1 1 4- I b — I„ + 
I p ; this leads to the formula 



Ip = 



(l e + l b )[l - (G /G)] I € + h - G AF 



\+b 



\ + b 



(13) 



where AFis given by (8). This last expression can be given the following 
simple interpretation. G A^is the current carried by the normal electron 
density n . The numerator of (13) is thus the extra current carried by 
injected holes and added electrons, and of this extra current a fraction 
1/(1 + ¥) is carried by holes so that the hole current is given by (13). 20 
If there were no decay, the current past the probe points would be yl t 
and, since 7 e is known, y could be easily determined. Actually, however, 

20 In these calculations the formula n = p + no, corresponding to completely ionized 
donors and acceptors, has been used. In germanium this is a good approximation. For 
silicon, however, modifications will be necessary. 
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there may be quite an appreciable decay. However, if the current h is 
increased, the holes will be swept more rapidly from the emitter to the 
probes and less decay will result. Thus by increasing I b , the effect of 
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recombination can be minimized and the value of hole current can be 
extrapolated to the value it would have in the absence of decay. This 
value is, of course, yl e . 

In Figure 3.5 we show some plots of this sort. The ordinate is I p /I t 
which should approach 7 as the value of I b becomes larger. The theory 
indicates that a logarithmic plot should be used and that the abscissa 
should be made proportional to transit time so that the case of no decay 
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or zero transit time comes at the left edge. 21 The conclusion reached from 
this plot is that, for the case of the »-type sample, the value of 7 is sub- 
stantially unity, all the emitter current is holes. For the opposite case in 
which electrons are injected into^>-type material, 22 the corresponding value 
of I n /I ( extrapolates to 0.6 indicating that for this case 60 per cent of the 
current is carried by electrons and 40 per cent by holes. For these partic- 
ular specimens the lifetimes are found to be 0.9 and 0.41 microsecond 
respectively. There is a body of evidence, some of which we discuss below, 
that holes combine with electrons chiefly on the surface of the filament. 

3.1c. Influence of Hole Density on Point Contacts. The presence of 
holes near a collector point causes an increase in its reverse current; in 
fact the amplification in a type-A transistor is due to the modulation of the 
collector current by the holes in the emitter current. The influence of 
hole density upon collector current has been studied in connection with 
experiments similar to those of Figure 3.3. After the hole current and the 
hole density are measured, a reverse bias of 20 to 40 volts is applied. The 
reverse current is found to be a linear function of the hole density. Figure 
3.6 shows typical plots of such data. Different collector points, as shown, 
have quite different resistances. However, once data like that of Figure 
3.6 have been obtained for a given point, the currents can then be used as a 
measure of hole density. This experimental procedure for determining hole 
density is simpler than that involved in using the two points and much 
better adapted to studies of transient phenomena. It is necessary in em- 
ploying this technique to keep the current drawn by the collector point 
somewhat smaller than It + 1 1 ; otherwise the disturbance in the current 
flow due to the collector current is too great and the sample of the hole 
current is not representative. Experiments have shown, however, that this 
condition is readily achieved and that the collector current may be satis- 
factorily used as a measure of hole density. 

The hole density also affects the resistance of a point at low voltage. 
Studies of this effect have also been made in connection with the experi- 
ment of Figure 3.3. After the hole density has been determined from 
measurements of AV and lb + I ( , a small additional voltage (0.015 volts) 
was applied between Pi and P% and the current flowing externally between 
Pi and P2 was measured. From these data a differential conductance, for 
small currents, is obtained for the two points Pi and P2 in series. As is 

21 If the lifetime of a hole is t, then the hole current at the points is I p — yl e exp (— //V) 
where / is the transit time to a point midway between the points, say a distance L from 
the emitter. If the electric field is E = &V/A.L, then the transit time / = LAL//* p t\F. 
Hence if In I v> as determined from the ratio of conductivities, is plotted against / = LAL/npAF 
a straight line with intercept In yl f and slope — 1/t should be obtained. See Chapter 12. 

22 Transistors using p-type germanium have been described by W. G. Pfann and J. H. Scaff, 
Phys. Rev. 76, 459 (1949). Electron injection inp-type germanium has also been observed by 
R. Bray, Phys. Rev. 76, 152, 458 (1949). 
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shown in Figure 3.7, this conductance is seen to be a linear function of the 
hole concentration. The conductance of a point contact arises in part from 
electron flow and in part from hole flow. From experiments using mag- 
netic fields, 23 it has been estimated that under equilibrium conditions the 
two contributions to the conductance may be comparable. In connection 
with Figure 3.7 it should be noted that the hole concentration on the 
abscissa is the average hole concentration throughout the entire cross 
section; the hole concentration may be much less near the surface due to 
recombination on the surface. 
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Fig. 3-6 — Dependence of Collector I c Upon Average Hole Density Being Swept by 
Collector Point. Collector Biased 20 Volts Reverse. 



Techniques of the sort described previously can be used to measure the 
properties of collector points. If a collector point is placed between the 
emitter and Pi in Figure 3.3, then the hole current extracted by the 
collector can be determined in terms of the hole current past Pi and P 2 - 
By these means an "intrinsic a" for the collector point can be determined. 
The intrinsic a is defined as follows: 

ai = intrinsic a = the ratio of change in collector current per 
unit change in hole current actually arriving at the collector. 

3.1d. Studies of Transient Phenomena. The technique of using a 
collector point to measure hole concentrations has been employed in a 

88 H. Suhl and W. Shockley, Phys. Rev. 75. 1617 (1949); 76, 180 (1949). 
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number of experiments similar to those described in connection with Figure 
3.1. These experiments give information concerning hole lifetimes, hole 
mobilities, diffusion and conductivity modulation. 

One of the methods employed to measure hole lifetime involves the meas- 
urement of the increase in collector current, produced by the arrival of the 
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Fig. 3-7 — Conductance of Pi and P-2 of Figure 3-3 in Series as Function of p/no 
Showing that Conductance Depends on Hole Concentration but Not on Currents in 
Filament. For Each Value of I € the Hole Density Was Varied by Varying lb + I t 

from .038 to 0.78 ma. 



leading edge of the hole pulse, as a function of the transit time of the holes 
from emitter to collector. This time is varied by varying the distance be- 
tween the emitter and the collector points. 

In Figure 3.8 we show a plot, obtained in this way, from a sample of 
germanium having dimensions 1.0 X .05 X .08 cm. It is seen that the 
increase in collector current due to hole arrival decays exponentially with a 
time constant of 18 microseconds. This time constant increases as the 
dimensions of the germanium sample are increased so that a time constant 
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of 140 microseconds was measured, using a sample having dimensions 
25 X 35 X 30 cm. Since the holes injected into the interior of this 
Lple can diffuse to the surface and recombine in about IOC ^seconds, 
the process may still be largely one of surface recombination. In any 
evenHt may be concluded that the lifetime in the bulk material used must 
be a 'least 140 microseconds. Making use of the electron density deter- 




10 



20 30 40 50 60 70 80 90 100 110 120 
TRANSIT TIME IN MICROSECONDS 



Flo . 3-8-The Decay of Injected Holes in a Sample of «-Type Germanium. 

mined from other measurements, we conclude that the recombination cross 
Tet on mTst be less than 1(T" cm*. This cross section which is ess than 
/4O0 the area of a germanium atom, may be so small because a hole-elec- 
roTp ir h« Acuity in satisfying in the crystal the conditions somewhat 
analogous to conservation of energy and momentum which hinder recom 
Hnatfon of electrons and positive ions in a gas discharge. Thus it has been 
pointed out that a hole-electron pair will have a lowest energy state in 
which the two current carriers behave something like the proton and elec 
Iron of a hydrogen atom. 24 Such a bound pair are called an exam and 
the energy given up by their recombination is the exciton energy. In 
order to Combine they must radiate this energy in the form of a light 

24 G. H. Wannier, Phys. Rtv. 52, 191-197 (1937). 
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quanta (photon) or a quantum of thermal vibration of the crystal lattice 
(phonon). The recombination time for the photon recombination process 
can be estimated from the optical constants for germanium and the theory 
of radiation density using the principle of detailed balancing, which states 
that under equilibrium conditions the production of hole-electron pairs by 
photon absorption equals the rate of recombination with photon emission ; 
the lifetime obtained in this way is about 1 second at room temperature 
indicating that the photon process is unimportant. 25 As has been pointed 
out by A. W. Lawson, 26 the highest energy phonon will have insufficient 
energy to carry away the "exciton energy" of a hole-electron pair and, 
therefore, the release of energy will require the cooperation of several 
phonons with a correspondingly small transition probability. 

When a square pulse of holes is injected in an experiment like that of 
Figure 3.1, the leading and trailing edges of the current at the collector 
point are deformed for several reasons. Owing to the high local fields at 
the emitter point, some of the holes actually start their paths in the wrong 
direction — i.e. away from the collector; these lines of flow later bend for- 
ward so that those holes also pass by the collector point but with a longer 
transit time than holes which initially started towards the collector. A 
spread in transit times of this sort is probably largely responsible for the 
loss of gain at high frequencies in transistors. For the experiments de- 
scribed below, however, this effect is negligible compared to two others 
which we shall now describe. 

On top of the systematic drift of holes in the electric field, there is super- 
imposed a random spreading as a result of their thermal motion. This 
would cause a sharp pulse of holes to become spread so that after drifting for 
a time t d the hole concentration would extend over a distance proportional 
to V r Di d where D = the diffusion constant for holes = kTjx p /e = 45 cm 2 /sec. 
As a result of this effect, the leading and trailing edges of the square wave 
of emission current become spread out when they arrive at the collector. 
This is shown in Figure 3.9, curve A for the leading edge and B for the trail- 
ing edge. The points are 10 microsecond marker intervals traced from an 
oscilloscope, the time being measured from the instant at which the emitter 
current starts. For A and B the emitter current was so small compared to 
the current h that the holes produced a negligible modulation of conductiv- 
ity and each hole moved in essentially the same electric field. It is to be 
observed that the wave shapes are nearly symmetrical in time about the 
half rise point and that the A and B waves are identical except for sign. 

26 Optical constants for germanium have been published by W. H. Brattain and H. B. 
Briggs, Phys. Rev. 75, 1705-1710 (1940) and H. B. Briggs, Phys. Rev. 77, 287 (1950). The 
integration over the radiation distribution was carried out by W. van Roosbroeck. 

26 Personal communication; a somewhat similar case is treated by B. Goodman, A. W. 
Lawson, and L. I. Schiff, Phys. Rev. 71, 191-194 (1947). 
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This is just the result to be expected from diffusion. Furthermore, analysis 
shows that the spread in arrival time is in good quantitative agreement with 
the theoretical wave shape using the diffusion constant appropriate for 
holes. For this case the mid-point of the rise, corresponding to the crossing 
point of the curves, gives the average arrival time and has been used to 
obtain an accurate measure of the mobility. 

Curves C and D correspond to conditions in which the emitter current 
was relatively large— two thirds of the base current. High impedance 
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Fig. 3-9 — Collector Current Characteristics for the Circuit Shown in Figure 3-1. 



sources are used so that lb is constant and I t is a good flat topped wave. 
For the currents used in this experiment, the conductivity is appreciably 
modulated by the presence of holes. This accounts for the shape of curve 
C, corresponding to the arrival of holes at the collector. It is seen that this 
curve is not symmetrical but is much more gradual towards later times. 
The reason for this is that the first holes to arrive are those which have 
diffused somewhat ahead of the rest and move in material of low conduc- 
tivity. The later holes travel in an environment of relatively high con- 
ductivity and, consequently, in a lower electric field. (Since the current 
is the same at all points between emitter and collector, the field is inversely 
proportional to the conductivity.) The transit time for the later holes is, 
therefore, longer and the hole density builds up more slowly for the latter 
part of the incoming pulse of holes. The wave form obtained from the 
trailing edge of the emitter pulse, curve D, is in striking contrast with the 
leading edge. The first gradual decay, up to point X, is due to recombina- 
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tion of holes and electrons; at / 3 the emitter current becomes zero; con- 
sequently, the electric field is reduced and the holes arriving at X have 
taken a longer transit time than the holes arriving at t 3 and a larger fraction 
of them have recombined with electrons. The true trailing edge, running 
from X to Y, is appreciably sharper than the leading edge. The reason for 
this is that holes lagging behind the main body of holes are in a region of 
relatively low conductivity and high electric field and tend to catch up with 
the main body. Thus the same effect which lengthens wave C acts to 
shorten wave D. 

C. Herring has been able to obtain mathematical solutions for the appro- 
priate equations bearing on the matters just discussed. 27 

The delay feature discussed in connection with Figures 3.1 and 3.9 
indicates interesting possibilities of using germanium filaments as delay or 
storage elements. 

3.2 MAGNETIC CONCENTRATION OF HOLES AND 
ELECTRONS, THE SUHL EFFECT 

Because of its great importance in affording a means of investigating nole 
lifetimes and related matters, we shall discuss the magnetic concentration 
effect observed by Suhl and Shockley. 1 This experiment represents an 
extension of the Hall effect beyond its usual range of application and gives 
direct experimental evidence that holes and electrons moving in magnetic 
fields are subjected to a sidewise thrust. 

The experimental arrangement may consist of an «-type germanium 
filament which is placed in a transverse magnetic field, Figure 3.10(a). If 
an electric field E x causes an electron flow parallel to the filament, then 
a Hall voltage will appear across the specimen and, for the polarities shown, 
the top surface will be charged negatively. 2 The effect is produced, 
according to theory, by transverse force tending to thrust electrons to one 
side as they drift down the filament. For the case shown, the top surface 
of the filament will become negatively charged, and the total electric field, 
having components E x and E y , will form angle d n with the lines of current 
flow. If a few holes are injected into this field by the current I n they will 
be deflected by the magnetic field towards the same side as the electrons. 
(As discussed in Section 8.7, the sidewise thrust is proportional both to 
velocity and to charge of the particle, each of which is reversed for a hole 
compared to an electron so that the magnetic thrust is in the same direc- 
tion.) In addition, the transverse component, E y , of electric field which 

27 C. Herring, Bell Syst. Tech. J. 28, 401-127 (1949); see also Chapter 12. 

1 H. Suhl and W. Shockley, Phys. Rev. 75, 1617-1618 (1949). 

2 An elementary description of the Hall effect is given in Sections 8.6 to 8.8. Figures 8.7 
and 8.9 show important features. 
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holds electrons away from the top surface, tends to concentrate holes 
there. The net result is that holes are drawn towards the top surface. 

Experimental evidence for this predicted behavior of the holes is obtained 
with the aid of the probe points. As Figure 3.7 shows, the conductance of 
a probe point is a linear function of hole density. Consequently, the 
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Fig. 3-10 — Magnetic Concentration of Holes and Electrons. 

conductance of the points can be used as a measure of hole density. The 
observed result can be visualized with the aid of Figure 3.11, which shows 
schematically the sort of effects expected. For no magnetic field, the hole 
current spreads out across the specimen by diffusion; anticipating the 
result to be established later, we have indicated that some of the lines of 
flow terminate on the surface where a major portion of the recombination 
of holes and electrons occurs. For a magnetic field Hi, the holes are 
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strongly deflected towards the top side of the filament. For H 2 > #i> 
the effect is still more pronounced and a second effect is indicated: the hole 
current dies away more rapidly to the right. The reason for this is that 
the holes combine with electrons on the surface of the filament and the 
increased concentration near the surface for H 2 compared to H x means an 
increased opportunity for recombination and a decreased lifetime. 



EMITTER 




(C) H = H 2 >H 1 



Fig 3-11— Paths of Holes Injected into w-Type Filament Showing Concentration 
by a Magnetic Field H directed into the Plane of the Figure. 

For a suitably designed experiment the effects may be quite large. The 
angles n and 6 P between currents and electric field have values, as given 
in Chapter 8, of 

8 n = 10-Vtf = 10- 8 X 2600 X 10 4 = 0.26 

= 15° for 10,000 gauss C 1 ) 

6 P = 10~ 8 X 1700 X 10 4 = 10° for 10,000 gauss. (2) 

For a longitudinal electric field E x of 20 volts cm, the total thrust trans- 
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verse to the filament is approximately equivalent to a field of 

E x tan {B p + n ) .= 8.5 volts/cm, (3) 

giving an effective voltage difference of 0.025 X 8.5 = 0.21 volt across a 
filament 0.025 cm in width. This is about 8.5kT, so that pronounced 
concentration will result. The total distance a hole must travel from one 
side of the filament to the other is 0.025/sin 25° = 0.06 cm. Since the drift 
velocity is about 1700 X 20 = 3.4 X 10 4 cm/sec, the transit time will be 
less than 2 X 10 -6 sec. This time is probably at least one order of mag- 
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Fig. 3-12— Dependence of Conductance of Probe Point Upon Magnetic Field. 

nitude less than the time for recombination in the interior of a good ger- 
manium specimen so that the concentration should be accomplished well 
before holes can get into equilibrium in the interior. 

A typical set of experimental data is shown in Figure 3.12. The con- 
ductance of a point, such as Pi, is shown as a function of H. Positive H 
deflects holes towards Pi, and negative H away from it. With the first 
increase of H above zero, the conductance rises, indicating a higher hole 
concentration produced in the neighborhood of the probe point. Still 
further increase of //produces a higher degree of concentration of the holes, 
as shown in Figure 3.11 ; however, at the same time, the holes decay more 
rapidly so that the signal at a point like P 2 is decreased. As a result of 
these opposing tendencies, the curve rises to a maximum and then decreases. 
For negative values of //, the holes are deflected away from the point, the 
lifetime is decreased as for positive //, and a monotonic decrease of the 
response results. 

The curve shown in Figure 3.12 is theoretical and has been fitted to the 
data by using as the law of recombination: 

(Rate of recombination of holes per unit area of surface) 

= (a constant s) X (hole density just next to the surface). (4) 
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The value of s, which has the dimensions of velocity, chosen to fit the data 
is 1500 cm/sec. Other methods of analysis have given comparable values 
for similar specimens. 

The failure of the theory to fit the points for large negative values of H is 
significant. Practically the same points are obtained if no hole injection 
is used at all. The interpretation is that this part of the curve arises from 
holes being spontaneously generated on the surfaces. These holes are 
swept away from the point for negative H; however, some holes still reach 
the point and affect its conductance. For large negative H, the limiting 
value of 20 micromhos is thought to be that corresponding to a nearly 
complete absence of holes. 

By a suitable proportioning of the specimen and the fields, it is possi- 
ble to separate the effects of surface generation and volume generation. 
The mathematical details are not appropriate to Part I, however, and.are 
postponed to Chapter 12. The practical use of the concentration of 
injected holes by magnetic fields has also been considered, particularly 
by R. L. Wallace. 

The purpose of this section on magnetic concentration of holes and 
electrons has been, on the one hand, to show one more item of experimental 
evidence that holes and electrons really behave as the theory of the later 
chapters predicts and, on the other, to introduce the reader to a new and 
powerful experimental technique which the author predicts will play a 
major role in the analysis of transistor materials. 

Problems 

(Remark: Most of the problems bearing on the material of this chapter 
involve solving the continuity equation. A number of such problems are 
given at the end of Chapter 12.) 

1. This problem is intended to show why deviations from 

n = n + p 

given in equation (5) of Section 3.1 do not ordinarily occur. Suppose that 
in the interior of a large block of «-type germanium with a dielectric con- 
stant of K e = 16 and conductivity a = 0.4 ohm -1 cm -1 the value of n - p 
differs by 1% of n throughout a sphere 10~ 3 cm in radius. Calculate the 
voltage and field at the surface of the sphere. 

Ans. 0.36 volt, 36 volts/cm. 

2. Assume that Figure 3.3 applies to the specimen of problem 2 of 
Chapter 1. Suppose that the current h is 2.9 ma and I ( is 1.8 ma and 
7 = 0.5 and that recombination is negligible. Using fi p =■ 1700 and 
y. n = 3600 cm 2 /volt sec and b = 2.1 (these values are in agreement with 
the problems of Chapter 1), show that the conductivity is increased from 
0.5 ohm _1 cm -1 to 1.23 ohm -1 cm -1 . Show that the electric field is reduced 
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from the value 0.94 volts/cm it would have if y = to 0.38 volts/cm due 
to the presence of injected holes and that the resistance is reduced from 
400 ohms to 162 ohms. Show that the electron concentration is increased 
from 8.70 X 10 14 cm -3 to 17.4 X 10 14 cm -3 . (Compare these results 
with problem 5 of Chapter 1.) 

3. Consider an «-type germanium filament of length L, cross-sectional 
area A and conductivity <r = en Q u n . Suppose that an added density of 
holes p[x) is present so that there a.rep(x)Adx holes in element of length dx. 
Show that if p{x) is much smaller than n , the increase in conductance of 
the filament is 

e(nn + Vp)L~ 2 f Ap(x)dx = e()x n + n P )LT 2 p io t. 

where p to t. is the total number of added holes. (This result may be ob- 
tained by writing the resistance of the filament in the form 

R = Jo dX ^ Ae ^ W ° + P( - X ^ + ^^ 

and expanding the fraction in powers of p(x)/n , which is assumed to be a 
small quantity.) This problem proves that for a uniform filament and for 
small hole densities, the change in conductance depends only on the total 
number df added holes present and not on their distribution. 



CHAPTER 4 
ON THE PHYSICAL THEORY OF TRANSISTORS 

In this chapter, the theories of hole and electron injection discussed in 
previous chapters are used to explain the internal workings of transistors. 
The discussion starts with the filamentary transistor. This is the simplest 
type from the point of view of exposition, since its ability to amplify de- 
pends chiefly on the conductivity modulation discussed in Section 3.1. 
The discussion then proceeds to other types of transistors which employ 
rectifying junctions in their output circuits. In Section 4.2 the discussion 
of rectifying junctions is begun with a treatment of one of the simplest 
types, the p-n junction. It is then shown how a transistor may be made 
by combining such junctions. In Section 4.3, metal point-contact rectifiers 
are described and compared with p-n junctions, and in Section 4.4 the 
theory of these rectifying junctions is applied to relate the physical picture 
to the equivalent circuit for the type-A transistor. In Section 4.5 a 
description of the electrical forming of the contacts in a transistor is pre- 
sented together with a discussion of structures leading to high values for a. 
Section 4.6 discusses briefly phototransistors and counters. 

4.1 THE THEORY OF FILAMENTARY TRANSISTORS 

4.1a. The Equivalent Circuit. In Figure 4.1 we show a transistor with a 
filamentary structure. 1 Modulation is achieved in this case by injecting 
holes at the emitter point which flow to the right and modulate the resist- 
ance in the output branch between emitter and collector. Structures of 
this sort can be produced by the sand-blasting technique discussed in 
Section 3.1. The enlarged ends, which give the unit a dumbbell appear- 
ance, decrease the problem of making contact to the unit. The large area 
at the left side serves the additional purpose of reducing unwanted hole 
emission from the metal electrode and affords an opportunity for any 
emitted holes to recombine before they enter the narrow part of the unit. 

The theory of this transistor is relatively simple and most of the features 
we shall discuss in connection with it have counterparts in the theory of the 
type-A transistor. We shall discuss the case for which the injected current 
is a small fraction of the total current in the filament. Under these con- 

1 Transistors of this type, employing p-n junctions as well as point contacts as emitters, 
have been discussed by W. Shockley, G. L. Pearson, M. Sparks, and W. H. Brattain, Phys. 
Rev. 76, 459 (1949). This section follows W. Shockley, G. L. Pearson, and J. R. Haynes, 
Bell Sy'st. Tech. J. 28, 344-366 (1949). 
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ditions we can use a simple linear theory. We shall show that the behavior 
of the transistor can be given for small a-c signals by the equivalent circuit 
in Figure 4.1, which shows the current and voltage relationships in a form 
equivalent to those used in connection with the type-A transistor. We 
shall carry out the analysis assuming that V b = 0, the grounded base condi- 
tion This procedure simplifies the algebra involved in deriving the equiva- 
lent circuit by eliminating one variable; the equivalent circuit itself is, ot 
course, applicable to cases in which the base is not grounded. 
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(b) EQUIVALENT CIRCUIT 
Fig. 4-1— Filamentary Transistor and Equivalent Circuit. 

The point / in Figure 4.1 represents a point in the filament near the 
emitter point The current from the emitter point will be determined by 
the difference between its voltage V t and that of the surrounding semi- 
conductor, namely the voltage at /. Thus we can write 

For small a-c variations, i„ v t and vj, this equation leads to the rela- 
tionship 

f , = (*. - vj)f t ', O 

where// is the derivative of/, in respect to its argument. Letting /.' = l/r«, 
we may rewrite this equation as 

v t - vj = r t i t . ( 3 ) 
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This relationship is correctly represented by the r ( branch of the equivalent 
circuit. The voltage at /, under the assumed operating conditions with 7. 
positive and much less that I c , will be —I b rb where r b is the resistance 
from the base to an imaginary equipotential surface passing through /. 
Since v b = 0, corresponding to grounded base operation, this leads to 

vj = —r b ib = +r b i ( + r b i c , (4) 

since i b + i t + i c = 0. This relationship is obviously satisfied by the r b 
branch of the equivalent circuit. 

We now come to the collector branch which we have represented as a 
resistance r c and a parallel current generator 2 a e i ( ; the value of r c corre- 
sponds to the case of no hole injection. (This circuit is equivalent to 
another in which the parallel current generator is replaced by a series 
voltage generator a e r c i f .) We must show that this part of the equivalent 
circuit represents correctly the effect of injecting holes into the right arm 
of the filament. We shall suppose that there is negligible recombination so 
that the hole current injected at the emitter point flows through the 
entire filament. (We consider recombination in the next section.) The 
current I c in the collector branch thus contains a component — yl t = I P 
of hole current [minus because of the algebraic convention that positive 
/ c (= — I b —I e ) flows to the left]. The added hole and electron con- 
centrations lower the resistance and r c changes to r c + 5r c , where 8r c is 
negative. The current voltage relationship for this branch of the filament 
then becomes 

V c -Vj= (r c + 8r c )I c . (5) 

Our problem is to reexpress this relationship in terms of the small a-c com- 
ponents and show that it reduces to the relationship 

v c — vj = r c (i c + a e i ( ) (6) 

corresponding to the equivalent circuit. For small emitter current the 
analysis is carried out conveniently as follows: The ratio of hole current to 
the total current is -yIJI c . The ratio (r c + 5r c )/r c corresponds to 
G /G discussed in connection with Figure 3.3. The ratio of hole current to 
total current is given in (12) of Section 3.1 in terms of G /G and may be 
rewritten as 

_ lh = 1 ~ ( Ga/G) = ~ 8rc (7) 

/. 1 + * <l + i)r,' 

giving 

5r c = r e (l + b)iIJI e . (8) 

2 The subscript e in a t implies equivalent circuit, a, differs from a = — (dl e /dl e ), e by the 
relationship a e = a + (a - l)(n/r,), equivalent to equation (24). 
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(Since I c is negative and 1 \ is positive this equation shows that Sr c is nega- 
tive, that is, the conductivity has been increased by the hole current.) 
Putting this value of r c + 8r c into the equation for V c — Vj gives 

V c -Vj= (r c + 8r c )I c 

= r c [I c + (1 + b)yl ( ]. (9) 

If we consider small a-c variations in the currents and voltages, this reduces 
to the equation given by the equivalent circuit with 

<*e = (1 + b)y. (10) 

The data 3 of Section 3.1 indicate that for holes injected into »-type ger- 
manium 7 = 1, and for b = 1.5 we obtain a e = 2.5. 

Using vj = niit + i c ) we eliminate vj from (3) obtaining an equation 
between v e and the currents. Similarly the small signal form of (9) gives 
an equation for v c : 

Of = (r ( + r b )i f + r b i e (11a) 

t>c = {n + oc e r c )i f + (r c + r b )i c . (lib) 

These equations are formally identical with those for the equivalent circuits 
of the type-A transistor and, therefore, lead to the equivalent circuit of 
Figure 4.1, which is identical with that of Figure 2.7. 

It should be emphasized that although hole injection into »-type germa- 
nium plays a role in both the type-A and the particular form of filamentary 
transistor shown in Figure 4.1, there are differences in the principles of 
operation. One important feature of the type-A is the high impedance of 
the rectifying collector contact which, however, does not impede hole flow 
and another important feature is the current amplification occurring at the 
collector contact. Neither of these features is present in the filamentary 
type shown. Instead, the high impedance at the collector terminal arises 
from the small cross-section of the filament. The modulation of the output 
current takes place through the change in body conductivity due to the 
presence of the added holes, a change which appears to be unimportant in 
the type-A transistor. In the filamentary type, current amplification is 
produced by the extra electrons whose presence is required to neutralize 
the space charge of the holes. Current amplification in the type-A trans- 
istor is, probably, also produced by the space charge of the holes 4 but the 
details of the mechanism are not as easily understood. We shall discuss 
theories of forming and of high a's in Section 4.5. 

4.1b. The Origin of the Positive Feedback and Instability. The fila- 
mentary transistor exhibits the same positive feedback as the Type-A and 

3 See Chapter 12 for the best estimates. 

4 J. Bardeen and W. H. Brattain, Phys. Rev. 75, 1208-1225 (1949). 
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for much the same reason. Because of its simple structure, however, the 
description is somewhat more straightforward for the filamentary type. 

We shall describe in words some of the phenomena associated with the 
positive feedback, next attempt a physical description of the processes 
involved, and finally derive the results in terms of the equations. 

Suppose the collector is maintained at a fixed d-c bias V cy say —20 volts. 
Next suppose that the emitter current is gradually increased from zero to 
a larger value. It is found, as discussed in connection with the static 
characteristics of Figure 2.6, that the emitter voltage rises to a maximum 
and then falls. In other words, the emitter connection exhibits a nonlinear 
response and a negative resistance ; although the emitter voltage is uniquely 
determined by the emitter current, the reverse is not true and, for a given 
value of emitter voltage, there is more than one possible value for the 
emitter current. This gives rise to possible instability if the transistor 
input is operated from a low-impedance voltage source, and a transient 
may cause a transition from a low-current condition to a high-current 
condition. With external circuit parameters properly chosen to limit the 
currents, both the high- and low-current conditions may be within the 
allowable dissipations. Such circuit arrangements may be employed in 
"flip-flop" and counting circuits. However, if the external resistors were 
chosen as discussed in Section 2.2 so as to give maximum available gam, 
then, although the circuit might be stable at the operating point, this 
stability might be conditional and a large transient could cause a transition 
to the high-current condition with possible destruction of the unit. 

In order to explain the particular behavior of V t as a function of I e 
shown in the static characteristics of Figure 2.6, we must show why the 
characteristic is nonlinear with a resistance which drops from positive to 
negative values. The explanation thus involves two parts, one having to 
do with the negative resistance and one with the nonlinearity. The 
negative resistance arises from the presence of positive feedback in the 
equivalent circuit coupled with values of a e greater than unity, and the 
nonlinearity arises chiefly from the varying resistance of the rectifying 
emitter contact. We shall discuss the negative resistance first, basing the 
description on the physical picture of current flow previously described. 
The negative resistance feature may be understood by considering the 
value of Vj as I, is increased. (For purposes of this exposition, which 
is intended to give the physical picture involved without the burden of 
mathematical refinements, we shall assume that y = 1 so that the hole 
current is I ( and shall neglect recombination so that a e equals b + 1 and 
is independent of the currents.) If V e remained constant so that the 
current through r c in the equivalent circuit of Figure 4.1 also remained 
constant, then the effect of I, would be to increase I c by -aj t . Thus, 
in addition to the injected hole current I \, which flows in the right branch 
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in part (a) of Figure 4.1, there will be an added electron current (a e — 1)1 ( 
due to electrons moving to the left. These added electrons must flow out 
through r b increasing the current from its initial value with 1 1 = by 
an amount (<*<. — l)/ e . This requires an added voltage across r b of 
r b (a e — 1)1 „ the sign being such as to make Vj change in the negative 
direction. Thus the injection of a positive current into the germanium at 
e makes the adjacent point go not more positive, as it would for the case 
of a positive resistance, but negative instead. If the drop across r e were 
negligible, this would mean that e would exhibit a negative resistance. 

However, r e is the differential resistance, d(V \ — Vj)/dl of a rectifying 
point operated in the forward direction. Thus it initially exhibits a large 
positive resistance, which decreases as I t is increased. For small I e , the 
positive resistance r t is much larger than the negative resistance (a e — \)r b ; 
however, for large values of I € , r ( may be less than (a e — \)r b , and the 
input resistance at e will shift from positive to negative, leading to the 
multivalued behavior discussed previously. In this treatment, the effect 
of changing Vj upon the current through r c was neglected. If r b >C r c , 
this, in effect, introduces an unimportant correction as we shall next show 
by dealing algebraically with the equations of the equivalent circuit. 

We shall now assume that all the parameters in the equivalent circuit are 
constant except r ( and shall calculate the differential resistance vji t when 
V c is maintained at constant bias. The constancy of V c requires that 
v c = and hence that 

v c = (r b + a e r c )i ( + (r e + r b )i c = (12) 

and, hence, that 

OeTc + n . 

tc= ; *«. (13) 

r c + n 

Using this value of i e in equation (11a) for v e gives 

i i ^ i ( a efc + r b \ 
\ U + r b } 

This leads to the differential resistance 

V< OieVtfc + n 2 1 

- = r ( + r b ■ = r t - (a e - l)r&— — • (15) 

i ( r c + r b 1 + r b /r c 

This equation shows that the input resistance v t /i t can be negative. If 
rb ^ r c , then the input resistance is negative when r ( < (a e — \)r b , the 
condition previously discussed. 5 

6 J. Bardeen and W. H. Brattain have previously derived essentially the same equation. 
Their expression is written as af'r b = 1, where/' = 1/Vn so that this is equivalent to afi/ru = 1- 
From the definition of a and the formula for a e given in Figure 2.7, it can be shown that the 
two conditions are equivalent. I am informed by R. M. Ryder that from the circuit point of 
view, all of these conditions are equivalent to m/3 = 1 in feedback terminology. 



(14) 
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4.1c. Effects Associated with Transit Time. Two important effects 
arise from the fact that a finite transit time is required for holes to traverse 
the r c side of the filament: during this time the holes recombine with elec- 
trons and the modulation effect is attenuated for this reason; also the 
modulation of the conductivity of the filament at any instant is the result 
of the emitter current over a previous interval and for this reason there 
will be a loss of modulation when the period of the a-c signal is comparable 
with the transit time or less. In accordance with the results given in the 
problems of Chapter 3, for small injected hole densities the change in 
conductance of the active branch L c of the filamentary transistor of 
Figure 4.1 is dependent on the total number of added holes and not on 
their distribution along the filament. 

In the problem at the end of this chapter, a treatment based on the total 
number of holes present is given. In this section, the result is obtained by 
reasoning by analogy using the results of Section 4.1a. 

For the small signal theory, the effect of transit time is readily worked out 
in analytic terms. We shall give a derivation based on the assumption that 
the lifetime of a hole before it combines with an electron is r p . According 
to this assumption, the fraction of the holes injected at instant h which are 
still uncombined at time t 2 is exp [-(/ 2 - h)/r v \. This means that the 
effect in the filament at any instant h is the average, weighted by this 
factor, of all the contributions prior to h back to time h — r t where r t is 
the transit time; holes injected prior to t 2 - rt have passed out of the 
filament by time t 2 . If the emitter current is represented by i l0 e tat , the 
effective average emitter current is 



*eeff 



(h) = *\o f " exp Mi - (/ 2 - h)/r t ]dti/T t . (16) 

Jto-n 



The term dti/n is chosen so that a true average is obtained since the sum 
of all the dh intervals add up to r t . The integral is readily evaluated and 
gives 

The result so far as the equivalent circuit is concerned is that obtained by 
taking a e as 6 

cc e = 7(1 + W, ( 18 ) 

6 The derivation of Equations flla) and (lib), describing the equivalent circuit, shows 
that hole injection enters only through the term SrJ c in (9). This term leads only to afcU = 
(1 +t)yr c i t in (lib) and should be replaced by (1 +6)y c i eea = (1 + i)ypr e i e leading 
to (18). 
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= 



1 — exp [-/lOTj — (Tt/Tp)] 
lUTt + (jth V ) 



[4.1c 



(19) 



(3 represents the effect of recombination and transit angle, wt ( , in reducing 

the gain. 

We shall consider two limiting cases of this expression. First if ur t is 
very small, the new factor becomes 



= (t„/t,)(1 



-ri/rp 



(20) 



If t ( is much larger than t p , so that the holes recombine before traversing 
the filament, then the exponential is negligible and /S becomes simply T p /r t . 
This means that the effectiveness of the holes is reduced by the ratio of their 
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effective distance of travel to the entire length of the filament, that is, 
T p /Tt is the ratio of distance traveled in one lifetime to the entire length of 
the filament. Essentially the holes modulate only the fraction of the 
filament which they penetrate. The transit time depends on the field in 
the filament which is \V C - Fj\/L c , the absolute value being used since V c 
is negative. The transit time is thus 

r t = L e /\» P \V e - Vj\/L c ] = L C 2 /» P \F C - Vj\. (21) 

For very small emitter currents V c - Vj = r c V c /{r c + r b ) so that 



r t = L c 2 {r c + r b )jv. v r c \V c 



(22) 



and T t is inversely proportional to V c . For large values of V c , r t approaches 
zero and /3 approaches unity. The dependence of /3 upon V c has been inves- 
tigated by measuring a and plotting it as a function of \l/V c \ as shown in 
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Figure 4.2. The value of 

« = - (a/ e /a/.) v c = - («c/i ,) v c (23) 

is readily found from the equivalent circuit, using equation (lib), to be 



n + _afc_ _ (24) 



n + r c r b + r c 

For one particular structure investigated, the values of r b and r c , obtained 
at I e = 0, were in the ratio 1 :4. The value of a obtained by extrapolating 
the data to \V C \ = » is 2.2; the value given by the formula for this case 
with /? = 1 and b = 1.5 is 

a = 0.2 + 0.8 X 2.5 X t, (25) 

from which we find y = 1.0, in agreement with the result of Figure 3.5 of 
Section 3.1 that substantially all of the emitter current is carried by holes. 7 
The theoretical curve shown on the figure is 

a = 0.2 + 0.8 X 2.5 X I*V10|(1 - e 10/lM ). (26) 



This corresponds to 

r t 10 L c 2 (r c + r b ) 



(27) 



t p \V C \ T p n p r c \F c \ 

from which it was concluded that for the particular bridge studied r v was 
0.2 microsecond. 

If t { is much shorter than t p , then the holes penetrate the whole filament 
and becomes 

a _ 1 ~ e x P ( — *W<) _ exp (-icoTf/2) sin (a>T t /2) _ . 

tUT f («Tj/2) 

For small values of wr t , /3 approaches unity since (sin #)/# approaches unity 
as x approaches zero. For cot ( /2 = ir, the response is zero. This is the 
condition that r t = 2tt/co = 1//. For this case the filament is just so long 
that the modulation is averaged over the time of one cycle of the input 
signal and since this average includes all phases, the modulation vanishes. 
Preliminary experiments with filamentary transistors, made in accord- 
ance with the principles discussed above, appear to confirm the general 
aspects of the theory. 8 Power gains of 15 db have been obtained and fre- 
quency responses showing a drop of 3 db in a at 10 6 cycles/sec have been 
observed. Noise measurements indicate an improvement of 10 to 15 db 
over the average type-A transistor for comparable conditions of preparation. 

7 This depends on the value of 1.5 for b which is based on the "Hall mobility." See 
Section 12.9. 

8 W. Shockley, G. L. Pearson, J. R. Haynes, Bell Syst. Tech. J., 28, 344-366 (1949). 
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4.2 p-n JUNCTIONS AND p-n JUNCTION TRANSISTORS 
4 2a The Nature of the Current in p-n Junctions. 1 The physical theory 
of rectifying junctions in transistors may be more simply described for 
1-n Junctions than for metal point contacts, and a discussion of the latter is, 
therefore, postponed until Section 4.3. Both collectors and emitters may 
be made from p-n junctions and it is possible to show how emitter junctions 
may be designed to have high efficiency for injecting carriers of the sign 
not normally present in appreciable numbers. 

The p-n junctions which we shall discuss occur when a piece of germanium 
or silicon has a variable concentration of donor and acceptor centers so that 
a transition from ^-type to K-type occurs in a continuous solid specimen. 
If two separate pieces of germanium of opposite conductivity types are 
simply placed in contact, however, layers of oxide or other material on the 
surface, surface states, roughness, etc., in general prevent a true p-n junc- 
tion from being formed. 2 

Good p-n junctions may be formed in many ways. They occur naturally 
in melts of relatively pure silicon because of segregation of the impurities 
upon solidification. 3 They have been produced in germanium by convert- 
ing one part of a piece of »-type material to ^-type either by nuclear bom- 
bardment or by heating. 4 , 

In Figure 4.3 we show an idealized p-n junction. Part (a) shows 
pictorially the distribution of donors, acceptors, holes, and electrons; 
(b) shows the densities of donors and acceptors; and (c) shows the densities 
of holes and electrons. These densities adjust themselves under the 
thermal equilibrium situation represented so that there is no current either 
of holes or electrons and so that the recombination of holes and electrons 
in any small element of volume just balances the thermal rate of production 
of holes and electrons. Since the holes are more concentrated in the jp-type 

i p.„ junctions were investigated before the war at Bell Telephone Laboratories by R S. 

Ohl Work on p-n junctions in germanium has been pubhshed by the group at Purdue 

S ected by K. Lark-Horovitz: S. Benzer, Phy, Re, 72, 1267 (1947); M Becker and H. Y. 

vTThysRe, 75, 1631 (1949); and H. Y. Fan, Phy, Re, 75, 1631 (1949). War Junc- 

rions occu in lead sulfide according to L. Sosnowski, J. Starkiewicz, and O. S.mpsor, Natar* 

59 818 (1947); L. Sosnowski, Phy, Rev. 72, 641 (1947); and L. Sosnowsk,, B W. Soole, 

and T Starkiewicz Nature 160, 471 (1947). The theory described here has been discussed m 

onne'ction wX photoelectric 'effects in p-n junctions by F. S. Goucher, Meeting of the 

Americ n Physical Society, Cleveland, March 10-12, 1949 and by W. Shockley, G. L. Pearson 

jM« a I« PAv t Rev 76 180 (1949). For a general review of conductivity in p- and 

:^;^e^L.XarIo 6 ; 1 and ( J. Baldeen, Phy, Re, 75, 865 (1949); J H. Scaff H. C 

The^rer and E. E. Schumacher, /. of Metals 185, 383 (1949); and W. G. Pfann and J. H. 

Icaff,/. <>/ Metals 185, 389 (1949). The latter two papers also di cuss ^vcjuc barners. 

* For a review of these matters see J. Bardeen, Phy, R /%\\' J 1 '^ ^l, (M9) . 

3 See T H Scaff, H. C. Theuerer, and E. E. Schumacher, /. of Metals 185, 383-388 (1949) , 

^A W G Pfann and T. H. Scaff, /. of Metals 185, 389-392 (1949). 

3 * U. Becker and RY. Fan, Phys.Rev. 75, 1631 (1949); W. Shockley, G. L. Pearson, and 
M. Sparks, Phy, Rev. 76, 180 (1949). 
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(a) Distribution of donors, acceptors, holes, and electrons 

(b) and (c) Densities. 

(d) Net charge density producing a dipole layer. 

(e) Electrostatic potential corresponding to dipole layer. 
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material, they tend to diffuse to the «-region. This tendency is prevented 
by an electrostatic field shown in (e). This field makes the electrostatic 
potential more positive in the w-type material so that the »-region is thus 
able to keep holes out and to hold electrons in. A similar situation holds 
true for electrons. In part (d) the net charge density is shown (neglecting 
any electrons trapped on donors or holes on acceptors, a legitimate assump- 
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Fig. 4—4 — Distribution of Holes and Electrons and Energy as a Function of Position 
in a p-n Junction under Applied Biases. 

tion for germanium, see Chapter 10). This charge density produces a 
dipole layer which in turn produces the potential difference shown in (e). 
The determination of the potential distribution under equilibrium condi- 
tions is, in principle, a straightforward application of the equilibrium 
theory discussed in Chapters 10 and 12. 5 (The potential rise is then found 
to be an inevitable consequence of the requirement that the Fermi level be 
constant throughout the specimen; when the appropriate analysis is 
carried out, the situation illustrated in Figure 4.3 is deduced.) 

We shall next consider the nature of current flow across the junction 
when voltages are applied. The situations of interest are shown in the 
first two rows of Figure 4.4 ; the bottom row will be discussed later. Under 

6 See W. Shockley, Bell Syst. Tech. J. 28, 435-489 (1949) for mathematical details covering 
the same ground as this section. 
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equilibrium conditions some of the holes shown in (d) will acquire enough 
energy by thermal agitation to climb the potential rise and diffuse into the 
w-region. Once in this region they will combine with electrons. This 
current is exactly balanced (according to the principle of detailed balancing, 
see Chapter 11) by holes thermally generated in the w-region, as members 
of hole electron pairs, which diffuse to the transition region and slide down 
the potential hill into the p-region. In a similar way electron currents flow 
and also exactly halance each other. 

Under conditions of reverse bias, a negative potential is applied to the 
p-region so that the height of the potential hill is increased, part (f). For 
biases of few tenths of a volt or more, the hill is so high that a negligible 
number of holes acquire sufficient thermal energy to climb it, and the hole 
current into the w-region substantially vanishes. The hole current/row the 
//-region, however, is practically the same as for (d) ; in part (f) the origin 
of this current is represented as the thermal generation of a hole electron 
pair followed by the diffusion of the hole to the transition region and its 
subsequent transit to the p-region. This hole current, as remarked pre- 
viously, is almost unaffected by the applied potential. For this reason the 
reverse hole current saturates, that is, reaches a limiting value as the 
reverse voltage increases. Precisely similar comments apply to the elec- 
tron flow, and it is also evident that the same hill which holds holes in the 
p-region holds electrons in the w-region ; that is, reverse voltage is reverse 
for both hole and electron currents. 

In the forward direction, the hill is diminished in height and, therefore, 
a relatively large hole current flows over it and into the w-region. Some of 
these holes diffuse back, the fraction being determined by the lifetime of a 
hole in the w-region. Mathematical theory can be applied to the currents 
involved, and the results can be expressed in terms of a current voltage 
relationship, which we shall quote below. 

Also shown in Figure 4.4 are the "energy band" diagrams appropriate to 
this situation. Although the basic theory of energy bands is not developed 
until the end of Chapter 5, the aspects of the theory needed in connection 
with Figure 4.4 are sufficiently simple so that they can be described directly 
from the figure. As we have discussed, an electron can be removed from 
the valence band and set in motion in the conduction band. Such an 
electron is in an excited state (like the car on the second floor) and its 
energy is higher than it would be in a valence bond. These conduction 
band levels are represented by the shaded region in the upper parts of the 
bottom row of Figure 4.4. A similar situation holds true for the levels in 
the valence-bond energy band. These lower levels correspond to holes, 
which are simply unoccupied levels for electrons. The donors and ac- 
ceptors are represented as plus and minus charges and are located at ener- 
gies corresponding to their abilities to bind holes and electrons. For the 



90 ON THE PHYSICAL THEORY OF TRANSISTORS [4.2a 

case of interest in this section, we shall suppose that binding occurs to a 
negligible extent so that we need not be concerned with these energies. 
(A further discussion will be found in Chapter 9.) The tendency of 
electrons to seek the lowest energy is equivalent to holes seeking the highest 
electronic energy level. Since it is quantum-mechanically impossible to 
have a half-excited electron (that is, there is no mezzanine or ramp in the 
garage), there is a gap of energy levels between the valence-bond states and 
the conduction states. The creation of hole-electron pairs by thermal 
excitation corresponds to the raising of an electron across the energy gap as 
shown in part (i). Kinetic energy of motion for the electron carries it to 
energies higher in the conduction band, and kinetic energy of motion of a 
hole carries it lower. These topics are discussed on the basis of the quan- 
tum theory of energy bands at the end of Chapter 7 (see Figure 7.6, for 
example). Since, furthermore, the charge on the electron has, unfortu- 
nately, been chosen as negative, the energy level diagram for electrons is 
upside down compared to the electrostatic potential. 

The net hole current across the junction can be derived for the model ot 
Figure 4.4 and is found to be given by the following equation, familiar in 
rectifier theory: 6 

I P = Ip. [exp (cV/kT) - 1] (1) 

where V is the voltage (positive values of V being forward) applied across 
the junction itself, e is the electronic charge, and kT is thermal energy. 
When V is expressed in volts and T is 300°K, corresponding to room tem- 
perature, this equation becomes 

J, = /,. [exp (39*0 - 1] 

= / P .,(10 17y - 1). ( 2 ) 

The value of /„, depends only on the »-type material and not on the 
p-type. The value of I ps is 

/,, = epnDp/VDpVp (3) 

where p n is the concentration of holes in the »-region, r p the lifetime of a 
hole in the «-region, and Mp and D p the mobility and diffusion constant for 
holes. (D p = n P kT/e = 1700/39 = 44 cm 2 /sec as shown in Chapter 12.) 
For a semiconductor in equilibrium, a mass action law [see Section 10.3, 
equation (19)] applies to the product of hole and electron concentrations. 

• W. Shockley, "Theory of p-n Junctions", Bell Syst. Tech. J. 28, «£489 (1949). .This 
equation was first published by C. Wagner, Phys. Zeits. 32, 641-645 1931), for a different 
model and represents the maximum nonlinearity attainable for purely electronic rectification. 
Equations (1) and (3) are derived in Chapter 12. 
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Hence, if the w-type material has a high concentration of electrons, it has 
a small concentration of holes and I ps will be small. 

Similar equations apply to the flow of electrons. Forward flow for 
electrons corresponds to the same polarity as for holes so that the hole and 
electron currents are those of two rectifiers 
in parallel. From the reasoning already 
given, we see that if the concentration of 
holes in the ^-region is much greater than 
the concentration of electrons in the n- 
region, then I ns will be less than I ps and 
the current will flow across the junction 
mainly as holes. 

There are a number of ways of obtaining 
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Fig. 4-5 — Unsymmetrical 
p-n Junction Showing How 
More Current (Forward is 
Shown) Is Carried by 
More Abundant Type of 
Carrier (Holes). 

this last result and it may be 
helpful to consider a different 
one. In Figure 4.5 we show 
a junction having concentra- 
tion of holes in the p-region 
greater than that of electrons 
in the w-region. For forward 
biases both holes and electrons 
must climb over the same 
potential energy hill. If there 
are many more holes than elec- 
trons, many more of them will 
climb over the hill and most 
of the current will flow in the 
form of holes. Such a junc- 
tion will obviously be a good 
emitter of holes. 

In Figure 4.6 we compare the theoretical curve with experimental data 
for a p-n junction. The fit is seen to be quite good except at high reverse 
fields where secondary effects reminiscent of field emission set in. It should 
be remarked that a 10 or 20% variation of e/kT from the theoretical value 
of 39 volts -1 will materially impair the fit of the data. This fact is a 
verification that the effective charge on the current carriers is e, the charge 
of the free electron. 
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Fig. 4-6 — Theoretical Rectification Curve and 
Experimental Data for a. p-n Junction. 
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4.2b. A p-n Junction Transistor. In Figure 4.7 we show an idealized 
transistor structure using two p-n junctions, which separate the two p-type 
regions P t and P c from the »-type region N b . Compared to the Type-A, 
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Fig. 4-7 — A p-n-p Transistor Compared with a Vacuum-tube Triode. 

this transistor has the expositional advantage that a detailed mental picture 
can be formed of the distribution of all the atoms involved. An instructive 
analogy can be drawn between this structure and a vacuum-tube triode, 
shown to the right. 

In part (g) of the figure, the equilibrium concentration of holes and 
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electrons is indicated together with the electrostatic potential (c). Holes, 
as suggested schematically in the diagram, tend to seek points of lowest 
electrostatic potential; electrons, having a minus charge, seek the highest 
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points. The energy band diagram is shown in (e). In (b) voltage in 
the reverse direction is applied across /„ and a small forward voltage 
across J t . If the />-type regions have much higher conductivity than 
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the ??-type region, most of the current across the junctions will be in 
the form of holes. Furthermore, if the w-type region Nb is so narrow that 
a hole can diffuse across it with a small chance of recombining, the holes in 
the current across J € will also flow across J c . 7 If the electron currents 
across the junctions are negligible and recombination in Nb is neglected, 
this transistor will draw no base current. If the emitter is grounded, a-c 
voltages applied to the base will require negligible currents (at least at low 
frequencies) and the behavior of the transistor will be like that of the 
vacuum tube on the right, except that the mobile carriers are positive 
rather than negative. 

An interesting additional comparison between the transistor and the 
vacuum tube may be based on the ways in which electrons carry current. 
In fact it appears that in both cases current can be carried by electrons in 
two ways which remain substantially separate and it is the control of one form 
of current by the other that enables amplification to occur. We shall first dis- 
cuss the two ways for the vacuum tube and then compare them with the 
hole and electron processes for a transistor. 

In the vacuum tube there are high concentrations of electrons in the 
cathode, the grid, and the plate as shown in parts (o) and (p) of Figure 3.7. 
There is a much smaller, but important, concentration of thermionically 
emitted electrons just in front of the cathode. When operating voltages 
are applied, the potential energy diagram for an electron will be as shown 
in (n). At the grid wires themselves the potential energy is high since 
the grid is negative in respect to the cathode. Between the grid wires the 
potential energy is relatively low, and the electrons flow through these 
passes between the peaks. If electrons are withdrawn from the grid wires, 
so as to reduce its negative charge, the potential energy will become lower 
in the grid region, and a larger current will flow. In the control region we 
thus find one electron current controlling another: electrons which flow in 
and out of the grid wires control electrons flowing in the space. The two 
currents do not become mixed because (1) the grid is negative with respect 
to the cathode so that thermionic electrons cannot reach it and (2) electrons 
in the grid wires cannot escape because they are held in by the work func- 
tion of the grid. 8 

In the transistor structure the application of voltage between emitter 
and base has an effect similar to the application of voltage between grid 
and cathode in the vacuum tube. The transition region corresponds to the 
grid cathode spacing and has a capacitance analogous to the grid cathode 
capacitance. For example, if a negative potential is applied to Nb, the 

7 Theory indicates that a negligible number of holes will be generated in the steep field 
region of J c for voltages at which satisfactory operation may be obtained. 

8 If the grid is overheated, thermionic emission occurs; we are not interested in such an 
unfavorable case, however. 
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flow of electrons into ]V& will charge this capacitance (and the collector 
junction capacitance as well) and will reduce the height of the potential 
hill over which the holes must drift to reach the collector junction. Thus 
for the transistor there is a region in which electron flow by one means, 
that is, excess electrons, controls flow by another means, that is, holes. 
However, in this case there will always be some recombination, and separa- 
tion of the current to the same degree as in a vacuum tube will be difficult 
to attain. 

It is evident that for the p-n-p transistor structure two quantities 7 and j8 
can be introduced as for the filamentary transistor. Since saturation 
current is being drawn across J c , we would expect that the change in col- 
lector current will be simply a = j3y times the change in the emitter current. 
The resistances of J \ and J c will replace r t and r c of the last section. The 
value of rb will be more complicated and will be an average of the resistance 
from the contact on the w-type material to the junction J%. From the 
approximate power gain formula, a 2 r c /r f we see that, since r c is .much 
greater than r t for the applied voltage condition shown, there will be large 
power gain, provided most of the holes can diffuse across the »-type 
materi al so that a is not too small. If a is nearly unity, the a-c base current 
will be very small. In this event the transistor can operate efficiently, in a 
manner similar to a grounded cathode vacuum tube, with the emitter 
grounded and the input signal applied to the base. 

In Section 4.5 we consider modifications of the p-n-p structure which will 
lead to collectors with high values of a. 

4.3 ON THE NATURE OF METAL SEMICONDUCTOR 

CONTACTS 

The type-A and coaxial transistors employ metal contacts for their 
emitter and collector points. These contacts, like the p-n junctions just 
described, may carry their current either as holes or as electrons, depending 
on the circumstances. In this section we shall endeavor to describe on an 
elementary level the phenomena occurring at metal semiconductor con- 
tacts. In order to describe these matters in any detail, it is advantageous 
to make use of the techniques of Fermi-Dirac statistics and energy bands. 
For this reason, we shall give only a superficial discussion here; the reader 
is referred to Chapter 12 in which a mathematical formulation of the statis- 
tics of "space charge layers" is presented. 1 

The theory of rectification in germanium differs from the .conventional 
theory by taking into account both hole and electron currents. In this 
section we shall lay particular stress on the hole emission aspect of the 
forward current in K-type germanium and will also indicate in a preliminary 

1 See also Torrey and Whitraer, Crystal Rectifiers, McGraw-Hill Book Co., New York, 1948, 
for a general survey of point contact rectifiers. 
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way how current amplification may occur at the collector. We shall 
return to both these topics, using somewhat more advanced concepts, in 
Section 4.5 where forming and the theory of high values for a are discussed. 

The term emitter was first introduced by Bardeen and Brattain to 
emphasize the role of the emitter contact in supplying a hole current for 
the collector; at that time the evidence indicated that the important hole 
flow occurred in a surface layer. The phrase hole injection was introduced 
to describe the penetration of holes into the body of the germanium. 
Hole injection was proposed in connection with the theory of p-n junctions 
and the p-n-p transistor and was independently observed by J. N. Shive 
in connection with the double surface transistor. 2 

In order to explain how hole injection can occur when the emitter is a 
metal point contact, we must consider the role of the surface states dis- 
cussed with Figure 2.3. We shall first describe what occurs when the 
point contact approaches and finally touches the surface. In order to 
simplify the discussion, we shall suppose that the metal point is electrically 
connected to the semiconductor. Then between the point and the semi- 
conductor there will be an electric field arising from the contact difference 
in potential. As the surfaces approach each other, this field becomes more 
intense, the induced charge on the semiconductor surface arising largely 
from a change in the number of electrons in the surface states. The 
tendency of the surface states to stabilize the surface potential is so great 
that only a very small change in potential in the interior of the semicon- 
ductor is produced by the contact-difference-in-potential field even when the 
metal is only one or two atomic diameters away from the surface. Fur- 
thermore, it is quite probable that the germanium surface is covered to 
a depth of one or two atoms with a layer of oxide or adsorbed atoms. 
Consequently, even after the metal is brought into mechanical contact 
with the semiconductor, the potential just inside the semiconductor surface 
will be much the same as it was before contact was established. Further- 
more, it is the potential just inside the surface that determines the rectifying 
properties of the contact, and the argument just presented shows that this 
potential is only slightly affected by putting the metal contact on the 
surface. 3 This argument has been used by Bardeen to explain why the 
rectifying properties of a germanium surface are independent of the work 
function of the metal used for the point contact. 4 

Although the potential at the surface of the semiconductor is independent 

2 For further historical details see Section 3.1, and J. Bardeen and W. H. Brattain, Phys. 
Rev. 75, 1208-1225 (1949). 

8 These matters are discussed by J. Bardeen, Phys. Rev. 71, 717-727 (1947), particularly 
equation (23). 

4 For additional data as germanium rectifiers see S. Benzer, /. App. Phys. 20, 804-815 
(1949). References. 
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of the work function of the metal which makes contact, the application of 
voltage to the point contact produces equal changes in the potential inside the 
semiconductor. This apparently contradictory result is explained as 
follows: Under thermal equilibrium conditions, the electrons in the surface 
states adjust their numbers so as to produce the fixed potential at the 
semiconductor surface by exchanging electrons with the interior of the 
semiconductor and also with the metal; the end result, however, like all 
equilibrium situations, does not depend on the nature of the mechanism 
which permits it to be established. When a potential is applied to the 
metal contact, on the other hand, the situation is not an equilibrium one. 
Accordingly, the surface states acquire a charge which represents a com- 
promise between attempts to get into equilibrium by electron exchange 
with the metal on the one side and with the interior of the semiconductor 
on the other side. If the metal is not in contact, so that electron exchange 
with it is prevented, then the situation of Figure 2.3 prevails and the sur- 
face gets into equilibrium with the interior and stays at the fixed potential 
in respect to the semiconductor. When the metal is in contact, however, 
the surface states exchange electrons very easily with the metal and 
maintain the same potential in respect to the metal as they would under 
equilibrium conditions. Thus the surface of the semiconductor comes to 
a potential in respect to the metal which is independent of the work func- 
tion of the metal; and this difference in potential then remains fixed so 
that the surface of the semiconductor follows the potential changes applied 
to the metal. 

The theory proposed by Bardeen and Brattain for hole injection by 
emitter points is based on the role of the surface states. According to 
these ideas a large density of electrons is normally present in the surface 
states. As a result electrons are repelled from the surface as shown in the 
Surface States Diagram, Figure 2.3. The effect is supposed to be still 
more pronounced than shown in Figure 2.3, however, and repulsion from 
the region near the surface occurs not only for excess electrons (in the con- 
duction band) but for some of the electrons in the valence bands as well. 
Accordingly a concentration of holes, represented in part (a) of Figure 4.8, 
is present immediately next to the metal contact. The situation is thus 
in many ways similar to the p-n junctions described in the previous section. 
If the junction of Figure 4.8 is biased in the forward direction, as shown in 
(b), a major portion of the current may be carried by holes. Similar 
considerations to those described in the last section have been used by 
Bardeen and Brattain as criteria for a good hole emitter. In essence, if 
the hole concentration near the surface is much higher than the electron 
concentration in the interior, then most of the current will be carried by 
holes. We shall return to a further discussion of this criterion in connec- 
tion with the theory of "forming" proposed by Bardeen and Pfann. 
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The reverse-current situation is represented in part (c). For this case 
the applied potential is in the same direction as the potential produced by 
the surface states and electrons are driven still farther from the point of 
contact. The reverse current is composed in part of electrons which gain 
sufficient energy thermally in the metal to get into the conduction band and 
then flow into the semiconductor. This excitation process is similar to the 
process of thermionic emission from a hot metal filament. However, the 
work function for escape into the semiconductor is so much lower than 
that of a metal that appreciable currents flow in the semiconductor at room 
temperature. Another portion of the current consists of holes which are 



CONTACT 
,METAl/ SEMICONDUCTOR 



EXCESS ELECTRONS 




HOLES 

. -s 


DONORS J 


P 


+ > 


®_ ©I 




+ 


- + 




+ ® + 


©_,_ © / 




+ +. — 


— ( 




+® r 


© © + ] 


1 


#+ ©_] 


z2z. 


+ ~" 


__ J3— — ' 




(a) EQUILIBRIUM 



(b) FORWARD 



■« 1 + 

(c) REVERSE 



Fig. 4-8 — Transistor Contact Diagram Showing the Nature of Rectification as It 

Occurs in a Transistor. 



formed thermally in the interior or on adjacent portions of the surface of 
the semiconductor. These holes are thus generated in much the same 
way and play much the same role as they do in a p-n junction. There is, 
however, one very important difference. Since the emission of electrons 
from the metal is limited by the work function they must surmount, the 
electron emission will be enhanced by any lowering of the work function. 
A hole flowing towards the metal attracts electrons and makes it easier for 
them to escape. Thus the presence of holes promotes the electron flow. 
We shall return to a further discussion of this process, which is one of the 
ways of explaining why a, can be > 1, in the section on forming. 

Rectification occurs in the metal semiconductor contact of Figure 4.8 for 
much the same reasons as in the p-n junction and we need not repeat the 
arguments in detail here. The chief difference is that the recombination 
and generation process taking place in the p-region of the p-n junction is 
here replaced by the emission and absorption of electrons over the potential 
barrier of the metal and by a similar process for holes. 

In a practical rectifier, it is usually necessary to consider the voltage 
drop in other parts of the unit as well as in the rectifying junction itself. 
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In a point contact rectifier this additional resistance is referred to as the 
spreading resistance. This may be visualized with the aid of Figure 4.9. 
The barrier layer introduces a resistance between the metal point and the 
semiconductor. The resistance of this layer is the junction resistance Rj. 
In series with it is the spreading resistance Rs of the body of the semi- 
conductor through which the current must flow in order to reach the metal 
base. The latter contact is usually so large that its resistance, even if it 
does rectify, is negligible; in general, the base contact is designed so as to 
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(b) * 
Fig. 4-9 — The Equivalent Circuit of a Point Contact Rectifier. 

reduce its nonlinearity. Most of the resistance Rs arises from the imme- 
diate neighborhood of the emitter point where the lines of flow are con- 
stricted; the value of Rs is thus a function of the area of contact. If the 
contact is regarded as circular a disc of radius a, then the spreading resist- 
ance is given by the formula 

Rs = ~ (l) 

4<r« 

where a is conductivity of the material. If the disc is taken as a hemi- 
sphere, the formula is Rs = l/lirva. 

According to "one current" theories of rectification, which do not allow 
for carrier injection, Rs is a constant. Rs can be determined experi- 
mentally by applying large forward voltages so that Rj becomes small; 
the limiting resistance then becomes Rs- It was found 5 that the limiting 
resistance in the forward direction for germanium was much smaller than 

6 R. Bray, K. Lark-Horovitz, and R. N. Smith, Phys. Rev. 72, 530 (1947). It was pro- 
posed by these authors that this effect was due to a change in conductivity in high fields and 
this effect was further investigated by R. Bray, Phys. Rev. 74, 1218 (1948). The high field 
effect was shown to be due to hole injection from the terminals by E. J. Ryder and W. Shock- 
ley, Phys. Rev. 75, 310 (1949), and this conclusion is further substantiated by the experiments 
described in Section 3.1. It has also been proposed that the low spreading resistance may be 
due to layer of surface conductivity of reversed conductivity type: J. Bardeen and W. H. 
Brattain, Phys. Rev. 74, 230, 231 (1948). 
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could be explained by the basis of the values of a and a. The explana- 
tion in most cases is that hole injection in w-type germanium increases 
the conductivity in Rs so that for forward currents both Rj and Rs are 
decreased. 

An additional effect of hole injection in w-type germanium rectifiers 
results from the high density of holes present while a large forward current 
flows: If the applied voltage is suddenly reversed, these holes are with- 
drawn by the point and produce a current pulse. For some applications, 
this effect can be large enough to cause damage to the unit. 6 

For purposes of the transistor theory, we have placed major emphasis 
upon the behavior of the two forms of current in the rectifying contact. 
There are, however, other features of theoretical interest which have been 
developed in connection with the "one current" theory of rectifiers. 
Except for the hole current, which is vital for our discussion, these theories 
describe most of the features shown in Figure 4.8. These theories are 
associated chiefly with the names of N. F. Mott 7 and W. Schottky. 8 
According to Schottky's theory, interesting information about the space 
charge layer, which is depleted of electrons, can be obtained by measuring 
the complex impedance of the junction and interpreting it as a resistance 
and capacitance in parallel. This capacitance can be shown to be that of 
a layer of dielectric having the thickness of the space charge layer. As is 
shown in Figure 4.8, the thickness of this layer varies with applied voltage, 
being wider at reverse voltages and narrower at forward voltages. Meas- 
urements of the capacitance as a function of applied voltage have been 
reported by a number of workers and the results are generally in excellent 
agreement with Schottky's predictions. 9 More recently Bardeen has 
analyzed the frequency effects in more detail and has defined the limits 
under which Schottky's approximate equivalent circuit should be valid. 10 

It is possible to show that a simple interpretation of the voltage drop 
across the barrier layer can be made in terms of the local conductivity. 
For this purpose, the layer is divided into a set of thin parallel sub-layers. 
These have varying electron densities and thus varying resistivities, and 

6 An effect of this sort has been observed by L. A. Meacham and S. E. Michaels, Phys. 
Rev. 78, 175-176 (1950) and has been called the "enhancement" effect by M. C. Waltz and 
R. R. Blair. 

7 N. F. Mott, Proc. Roy. Soc. London 171A, 27-38 (1939); also N. F. Mott and R. W. 
Gurney, Electronic Processes in Ionic Crystals, Oxford University Press, 1940. 

8 W. Schottky and E. Spenke, Wiss. Verbff. aus die Siemens Werken 18 (3), 1-67 (1939); 
W. Schottky, Zeits. fur Physik 113, 367-414 (1939); 118, 539-592 (1942); F. Rose and 
E. Spenke, Zeits. fur Physik 126, 632-641 (1949). For a simple summary of Schottky's theory, 
see J. Joffe, Electrical Communication, 22, 217-225 (1945). 

9 See the references to Schottky's papers already cited and Torrey and Whitmer, Crystal 
Rectifiers, McGraw-Hill, New York, 1948, and a paper compiled by S. J. Angello, "Semi- 
conductor Rectifiers", Electrical Engineering 68 (10), 865-872 (1949). 

10 J. Bardeen, Bell Syst. Tech. J. 28, 428-434 (1949). 
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the total resistance of the barrier layer (denned as applied voltage divided 
by current) can be shown to be simply the sum of the individual resistances 
of the sub-layers. The total width of the barrier and the resistances of 
the sub-layers both increase with reverse voltage and decrease with for- 
ward voltage, thus accounting for the dependence of resistance upon cur- 
rent in the barrier layer. This picture can be put into quantitative form 
and shown to be essentially equivalent to other approaches. 

There is, however, a serious pitfall in the method just described; once 
the idea of separating the junction into parallel layers has been considered, 
there is a strong temptation to deal with each layer as a resistance and 
capacitance in parallel and then to try to synthesize the junction impedance 
by combining these parallel combinations in series. This proposal can be 
shown to be entirely incorrect; the arguments, however, are not simple and 
we refer the reader to the recent analysis of Bardeen from which it may be 
seen that the correct solution bears no resemblance to such a series synthesis. 

4.4 THEORY OF THE TYPE-A TRANSISTOR 

4.4a. Relationship Between the Physical Picture and the Equivalent 
Circuit. We shall now combine several ideas, some of which were developed 
for simpler cases, in order to obtain a description of processes in the type-A 
transistor and a derivation of its equivalent circuit. The ideas of chief 
importance are: 

(1) Hole injection by the emitter point; 

(2) Current multiplication at the collector point with an intrinsic 
alpha of a; ; 

(3) The relationship of the filamentary transistor to its equivalent 
circuit. 

We shall modify (3) in a suitable way so that it applies to the geometry of 
the type-A transistor. As a result of this procedure we shall obtain a 
physical picture of current flow in the type-A transistor together with an 
interpretation of its equivalent circuit in terms of more fundamental 
mechanisms. For this purpose we shall refer to Figure 4.10 which illus- 
trates the current flow in this case. 

The numerical values on Figure 4.10 are in approximate agreement with 
typical data for a type-A transistor as represented in the sets of static 
characteristics of Figure 2.6 except for the set of emitter voltages given in 
parenthesis. These latter correspond, as is discussed in more detail later, 
to increasing r i2 in the equivalent circuit by 100 ohms so that a negative 
input resistance is developed. 

There are two principal differences between this structure and the 
filamentary type: (1) The high collector resistance in this case is highly 
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localized and arises from the junction resistance at the collector; the 
major portion of the transit time for a hole is spent in a relatively low 
resistance part of the unit between the emitter and collector. (2) The 
electron paths from the collector are not restrained by the geometry and 
flow over a wide range of directions. 
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Fig. 4-10 — Transistor Currents Diagram. 

The figure has been drawn as if all of the emitter current consisted of 
holes and all of these holes reached the collector. Actually we should 
introduce factors y and /3: 

7 = fraction of the emitter current carried by holes (for a £-type 

transistor, the fraction carried by electrons) ; 
j8 = fraction of holes (or electrons for p-type) leaving emitter which 
arrive at the collector. 

For ease of reference we shall also repeat the definitions of the various forms 
of a for a w-type transistor (for p-type transistor holes and electrons are 
interchanged): 

a = -(dl c /dl e ) with V c = constant; 
a e = alpha for the current or voltage generators in the equivalent 

circuit of Figure 2.7 (subscript e for equivalent circuit). 
on = "intrinsic a" = increase in collector current per unit increase of 
hole current to the collector point at constant collector voltage; 
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1 — aj = evidently, increase electron current from oollector per unit 
increase in hole current. 

In terms of f) and 7, the hole current reaching the collector will be 

Ipc = -07/.. (1) 

The minus sign results from the sign convention that a positive current 
corresponds to current into the unit. If each of these holes provokes the 
emission of (a, — 1) extra electrons, an increase of collector current 
amounting to 

A/c = -0t/« - (a,- - 1)|87/. = -aifofl, 

= -a/. (2) 

will be produced by the emitter current. Thus the quantity a is really 
the composite result of several effects and is given by the equation 

a = a#y. (3) 

The total collector current will thus be 

h = /co(^c) - CJ. (4) 

where I c o{V c ) is the function of V c which gives the reverse characteristic 
of the collector point in the absence of emitter current. This equation 
thus gives one relationship among the three parameters V c , I c , I e , which 
together with V \ are used to describe the static characteristics. 

In order to derive an equation involving V t , we use methods similar to 
those for the emitter point of the filamentary transistor, Section 4.1. 
For that case we found that, according to equations (1) and (4), 

I t =f t {V t -Vj) (5) 

where Vj was the potential in the germanium just below the emitter 
point and was given by 

Vj = (/. - 7 e )*6. (6) 

For the type-A transistor, R b is a less clearly defined concept. As Figure 
4.10 indicates, when 1 1 is zero, the emitter will float at a potential cor- 
responding to the equipotential which passes through it. For Figure 4.10, 
this equipotential is separated from the base by 0.25 volt/0.8 ma = 
312 = 300 ohms. When emitter current flows, the conductivity in its 
neighborhood is modulated and the current paths distorted. 

An exact treatment of this situation is very difficult and we shall accord- 
ingly proceed by introducing the simplifying assumption of additivity oj 
potentials. This additivity assumption would, of course, be exactly ful- 
filled if the conductivity modulation were negligible. For constant con- 
ductivity, the potential, which we shall again call Vj, immediately below 
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the emitter point would be obtained simply by superposing the potentials 
due to the currents I c and I e individually. According to our previous 
definition of R b , the collector current will produce a contribution RtJ c . 
Similarly I t will produce a potential R b 'l e . The spreading of I c over a 
large area equipotential surface by the time it passes the emitter point will 
cause R b to be less than R b '. In terms of R b and R b ' and the additivity 
assumption we may write 

/. =/.(^« - rj) =/«C. - R*i* ~ We). (V) 

This gives a relationship between F ( and the two currents I e and I c and, 
together with the equation for I c , completes the set of two equations 
required for the static characteristics. 

The quantity J i(V f — V j) is not readily measurable since Vj cannot 
be directly determined. It is, however, straightforward to measure the 
current voltage characteristic of the emitter contact by itself. This leads 
to the relationship 

/. = /.oC.) = fkv* - Rt'Uir.)] (8) 

for the case of zero collector current. It is an elementary problem in 
functional relationships to show that from these two last equations we 
must have 1 

i t =f*(r,-Rbie). (9) 

This last equation was derived more directly by Bardeen and Brattain 
by the following argument. If I t =/ e o(^«) when I c = 0, then when 
I c ?* an effective bias of +R b I c is applied. To obtain a given emitter 
current it is thus necessary to add sufficient extra voltage to overcome the 
bias, and thus V t is replaced by V t - R b I c . This argument is equivalent 
to the one given previously and both depend on the assumption of additivity 
for the voltages due to I e and I c . We have presented the more detailed 
argument to show more completely how the additivity assumption enters 
the theory. 

The additivity assumption will be a good approximation even if consid- 
erable conductivity modulation occurs. This modulation will have as its 
principal effect a modification of the forward current characteristic of the 
emitter point, and this effect will be most important in the immediate 
neighborhood of the emitter point where the spreading resistance, discussed 
in Section 4.3, chiefly arises. If the collector does not disturb the modula- 
tion in this region greatly, then the contribution o{I ( to the emitter voltage 
will be largely independent of collector current. Furthermore, if the 
modulation of conductivity takes place over a limited region near the 

1 The first equation states that there is a functional relationship between I t and V t — Rbl, 
writing this as I t = %{V t - Rtl c ), we see that, for I c = 0, we must have g{V,) =/«o(^ «)• 
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emitter, the value of r& will not be greatly altered. If these conditions hold, 
the additivity assumption will thus be well fulfilled. The additivity 
assumption is actually found to be adequate only over limited ranges of 
the currents, and n, is found for one example to be about 1000 ohms for 
I t = 0.1 ma and about 400 ohms for I t between 0.2 and 0.8 ma. 2 For the 
purpose of showing semiquantitatively the relationship of the physical 
picture to the equivalent circuit, however, the additivity assumption is 
satisfactory. 

In terms of the functions already described the equations for the equiva- 
lent circuit may be readily derived as follows: The collector resistance r 22 
is obtained from 

(10) 



(ID 



(12) 

which corresponds to the right-hand branch of the equivalent circuit of 
Figure 2.7(a). From the formula for emitter current, 

/, =/*C. -/Wc), (13) 



h = /co(fc) 


— al f 


by finding the differential of both sides. 


We thus obtain 


1 
ic = — Vc — al t , 

^22 


1 dI C Q 

r 22 dV c 


leading to 




Vc = 00-22* f + ?Wc 




= r 2 ii t + r 22 i c 


(r 21 s= ar 22), 



obta 



leading to 



l n . 1 

U = — v ( /„ — 

ru r n r n 



_5^ e J/ f =con 



(14) 



v< = r n i f + r 12 i c (r l2 = n) (15) 



which corresponds to the left side of Figure 2.7(a). 

These last equations show the relationship of the physical model of 
Figure 4.10 to the equivalent circuit. The description of the equivalent 
circuit has been given in some detail and will not be repeated here. We 
shall, however, review the question of instability and negative input 
impedance. The decrease in input impedance with increasing current was 
shown in Section 4.1 to be due to decreasing resistance in the input contact 
with increasing currents. This same effect occurs in the type-A transistor. 
The positive feedback is again associated with having a greater than unity 
and with the presence of a resistance r&. If the collector is connected to a 
zero impedance voltage supply, a negative resistance arb appears in the 

3 R. M. Ryder, Bell Laboratories Record 27, 89-93 (1949). 
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input circuit in addition to r n . On Figure 4.10 we show the effect of 
increasing r b by 100 ohms while leaving / e0 unaltered. The result is 
to change V t by 100/ c or by -0.08, -0.16, and -0.28 volt for the cases 
shown, the new values being given in parentheses. This increase in r b 
is sufficient to give the unit a negative input impedance in the range 
I ( = 0.4 to 1.0 ma as shown in going from (b) to (c) of Figure 4.10. 

The theory of the double surface or coaxial transistor is, except for the 
change in geometry, the same as that just described. 

4.4b. Transit Time Effects. Transit time plays a very different role in 
the filamentary and the type-A transistors. The high impedance region of 
the output of the type-A transistor is the barrier layer which is about 
10 -3 cm thick and has a voltage drop of 25 volts. For such a layer the 
transit time would be negligible compared to effects of interest in transistors. 
If all holes followed paths of equal length from emitter to collector and took 
the same time, the current generator in the collector circuit would be a 
faithful but delayed replica of the input signal. If, however, holes travel 
in unequal paths, as shown in Figure 4.10, then the hole current at the 
collector will represent an average of the emitter currents at various pre- 
vious times. If the important spread of times is comparable to one cycle 
of an a-c input signal, there will be a great deal of cancellation and the gain 
will be reduced. _ 

In Figure 4.11(a) we indicate a means of estimating the transit time and 
its spread. The current leaving the emitter is divided into several parts 
each bounded by flow lines or surfaces. In order to deal with a simple 
approximation, we neglect diffusion effects and modulation of the conduc- 
tivity. Under these conditions a simple relationship exists between the 
volume through which the current flows and the average transit time. This 
is illustrated in (b). A current / flows through a tube bounded by flow 
surfaces and having an area A (I) where / is length along the tube. The 
lines of flow in this case are readily computed from electrostatic theory. 3 
The field E and drift velocity v at position / are obtained as follows: 

cAE = I, E = II a Ay v = v.E = y.I/cA. (16) 

The length of time required to traverse a distance dl is then dt where 

dt = dl/v = aAdl/y.1 = (p/vI)AVi (17) 

where dVj is an element of volume of the tube. From this it is evident that 
the transit time is simply 

/ = cVx/y.1 (18) 

where Vi is the volume used by the current /. 

8 See Jeans, Mathematical Theory of Electricity and Magnetism, Cambridge at the Univer- 
sity Press, 1927, Figure 19. 
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It is evident that, for the lines of flow of Figure 4.11, there will be great 
variations in transit time. As a rough approximation we shall take the 
spread of transit times as being comparable with the transit time for elec- 
trons in the group 1%. This group will contain about half the emitter 
current. From the figure the volume of its tube, which has the line from 
e to c as a symmetry axis, is estimated as that of a cylinder S units long, and 
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Fig. 4-1 1 — Spread in Emitter Current Paths and Geometry of a Tube of Flow. 

about 0.6S units in average radius over half a circular cross section. Its 
volume would thus be 



Vi = - (0.6 SfS 



S 3 /2. 



The transit time would then be 



If we insert the typical values: 

rv-3 



cS 3 
2fJ 2 



S 3 






I t = Iff"" 3 amp, ju = 1700 cm 2 /sec, a = 0.2 (ohm-cm) 
S = S X 10~ 3 cm (= 2 mils), 
we obtain 

0.2 X 125 X 10~ 9 



-l 



1700 X 10~ 3 



= 0.015 microsecond. 



(19) 

(20) 

(21a) 
(21b) 

(22) 



If, as supposed, the transit time spread is comparable to this, the output 
would be nearly zero at 60 mc/sec and the three db point would be about at 
something less than half this frequency or 30 mc. 

This crude calculation, which was given only to illustrate the method, 
predicts a cutoff frequency, about four times higher than that observed 
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for a spacing of 2 mils, which corresponds to S = 5 X 10~ 3 cm. The dis- 
crepancy is probably due in considerable measure to conductivity modula- 
tion near the emitter point, which in effect will raise a and increase the 
transit time. 

It is evident from this treatment that situations which constrict the 
current paths between emitter and collector will tend to decrease the 
transit time. One example of this sort is furnished by effect of applying 
a magnetic field to a transistor in such a direction as to deflect holes which 
start downwards in Figure 4.11 towards the collector. This will tend to 
constrict the tubes of flow and to reduce transit time. 4 

4.5 ON THE THEORY OF "FORMING" AND HIGH 
VALUES OF a 

It was observed by Brattain in some of the earliest transistor experiments 
that the performance of the transistor could be greatly improved by passing 
large reverse currents through the collector point. Since that time a 
considerable art has developed in connection with forming, and a number of 
techniques for applying controlled currents and voltages for various lengths 
of time have been used. We shall not attempt to survey these but will 
restrict the discussion to (a) some general observations on the effect of 
current polarity and conductivity type on the forming process and to (b) 
some speculations about the nature of collectors which have high values, 
say greater than 5, for a. 

4.5a. The Forming Process. The forming treatments considered by 
Bardeen and Pfann 1 involve applications of direct current in either the 
forward or reverse direction for times of the order of seconds or less. The 
extent of the changes caused by forming depends in part on the surface 
treatment of the germanium and in part on the material used for the point 
contact. For their investigations, Bardeen and Pfann used phosphor 
bronze points on surfaces which had been ground and etched. They 
found that the effect of forming can be systematized in tabular form as 
shown on p. 109. 

As Bardeen and Pfann point out, this array of data can be correlated 
with the aid of one assumption about the effect of forming current: 

The forming current leads to a change in height of the potential barrier 
opposite to the change imposed during forming by the applied voltage. In 
other words, if the point contact is negative during forming, so that the 
potential barrier over which an electron must climb in going from the 
semiconductor to the metal is increased, then after the forming voltage is 
removed the barrier will be altered so as to be lower than its preformed 

4 This effect was reported by C. B. Brown at the June 1949 IRE Conference on Electron 
Devices at Princeton University, and in Phys. Rev. 76, 1736-1737 (1949). 
1 J. Bardeen and W. G. Pfann, Phys. Rev. 77, 401-402 (19^ 
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Table 4.1 Effect of Forming Treatment 



Unformed 



Formed with point + 
(+ current) 



Formed with point 
(— current) 





W-TYPE 




(c) 
Forward Current {point +) : 

Mostly holes, good emitter 
Reverse Current {point — ): 

Small, good rectifier 


(d) 
Forward Current {point +) : 

Little change, or larger 
Reverse Current {point — ): 

Smaller 


(e) 
Forward Current {point +) 

Hole current smaller 
Reverse Current {point — ) 

Increased; collector 

formed 



p-TYPE 



(f) 

Forward Current {point —): 
Small, few electrons; poor 
emitter 

Reverse Current {point +): 
Large, poor rectifier 



(g) 
Forward Current {point — ) 
Like unformed 

Reverse Current {point +) : 
Increased 



(h) 

Forward Current {point — ) : 
Electron current in- 
creased, emitter formed 

Reverse Current {point — ): 
Large 



value. In Figure 4.12 this effect of forming is illustrated. The six cases 
shown correspond to those of Table 4.1, and for each of these the effect of 
forming is to alter the potential barrier in a sense opposite to that of the 
applied forming voltage, which is also shown in the figure. 

(c) The normal contact to w-type germanium is supposed to produce a 
barrier layer with a rise in potential tp 3 , which is determined largely by the 
surface states and very little by the nature of the metal contact. It is also 
supposed that the barrier is not entirely uniform and is higher at some 
points. At these points, the hole concentration is especially large and 
when the junction is operated in the forward direction, the current is 
carried chiefly by holes which flow into the germanium from these regions 
of high concentration. 

(f) Since the potential of the surface states is the same for both w-type 
and />-type, the distribution of potential will be altered from (c) as shown 
in (f) for />-type germanium. The low potential barrier for holes leads to 
easy flow of holes and poor rectification and small electron emission. This 
is in agreement with the known poor rectification usually found for con- 
tacts with />-type germanium. 

In (a) and (b) the fields present during forming are shown. The result- 
ing forming effects can be interpreted by assuming that under forming 
conditions the temperature and field are so high that motion of the impur- 
ities results. The motion will then produce the changes in concentration 
and potentials shown below (a) and (b), the result being that <p s will tend 
to be changed in a direction opposite to the applied potential. 
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(e) The desired change for K-type germanium is to cause an increase in 
collector reverse current. This permits it to set up an electric field suf- 
ficiently large so that it can efficiently collect the injected holes. 
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Fig. 4-12 — Interpretation of Forming by Bardeen and Pfann. 



(h) In p-type germanium the reverse current is large enough without 
forming, but the emitter current ordinarily contains too small a fraction of 
electrons. By lowering <p s as shown in (h), regions of high electron con- 
centration are formed in front of the metal point and these can act as good 
electron emitters. 

4.5b. Interpretation of High Values of a. The hole current collected in 
a type-A transistor causes an enhanced electron flow from the collector, as 
js evidenced by the fact that a is greater than unity. It was mentioned in 
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Section 4.3 that a values greater than one can be explained by the space 
charge due to holes as they approach the collector point. This space 
charge will have an effect similar to the positive ions of Figure 4.12(e), 
whose concentration directly in front of the collector contact tends to lower 
the work function for electron emission and thus to increase the reverse 
current. 

This theory of a was proposed by Bardeen and Brattain in their first 
article. 2 As we shall discuss below, it seems possible to explain a values as 
high as 3 by this means. Values of a much larger than 3 have been ob- 
served, however, and in order to explain these it is necessary to make major 
revisions of the way in which the space charge of the holes is effective in 
increasing electron emission. After discussing the theoretical limit of the 
simple space charge theory, we shall describe the other theories. 

We shall now show that the maximum value for a based on the simple 
space charge theory has the same value (1 + b) for a type-A transistor 
that it has for a filamentary transistor and for much the same reasons. Let 
us suppose that an increment of hole current Al p produces by its space 
charge an increment of electron current Al n . Now consider what the total 
space charge situation would be if Al n = bAl P , corresponding to a = 1 + b. 
We shall show that Al n cannot be this large. It is evident that Al n will 
produce a space charge p„, which tends to cancel p p due to Al p ; evidently 
p n cannot quite cancel p p because, if it did, there would be no net positive 
space charge and hence no enhanced electron emission. However, if 
Al n = bAl p , then p n will just equal p p since, for equal charge densities, 
electron and hole currents would be in the ratio of b : 1. From this we 
conclude that the hole current cannot produce an electron current as great 
as bAl p and hence that a, or more precisely a,-, cannot be as great as 1 + b. 
This argument depends upon three assumptions, tacitly made in the pre- 
ceding discussion: (1) The ratio of mobilities is not affected by the high 
fields near the collector point; (2) holes and electrons flow along the same 
current paths (which may not be true if the junction is "patchy"); and 
(3) holes have no trouble entering the metal and thus do not accumulate 
in front of the metal on their way in. Although these assumptions are 
probably not exactly fulfilled, it is unlikely that they introduce any sig- 
nificant errors in the argument. 3 

It has also been proposed that the current multiplication may be due to 
the generation of secondaries by fast-moving holes in the intense field near 
the collector. There is a large amount of evidence against this theory. 
In some cases high values of a have been observed at such low voltages, 
say 5 volts, that secondary generation seems quite unreasonable. Further- 
more, if secondary generation were the important process, we should expect 

2 J. Bardeen and W. H. Brattain, Phys. Rev. 7S, 1208-1225 (1949). 

3 Similar reasoning has been presented by L. P. Hunter, Phys. Rev., 77, 558 (1950). 
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a to increase rapidly with applied voltage; in fact, this process would be 
similar to avalanche production in dielectric breakdown and the value of 
a would be expected to vary exponentially with voltage. This has not 
been observed, and the data are much more consistent with the idea that 
a; is independent of applied voltage, as if each hole provoked the emission 
of a fixed number of electrons regardless of the value of the collector voltage. 
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Fig. 4-13 — The "p-n Hook" as a Current Multiplier. 

For these reasons, we shall reject the theory of secondary generation and 
proceed to other theories based on the space charge of the holes. 4 

The space charge theories which we shall propose have the common 
feature that in each case the holes are impeded in their progress to the metal 
contact so that their space charge accumulates, thus having an enhanced 
effect on the electrons. 

In Figure 4.13 a transistor having a current multiplying collector region 
is shown. This transistor differs from that of Figure 4.7 in having a "multi- 
plying" region of «-type material, denoted by N m , directly behind the 
p-type region, P c . The region P c is supposed to be so thin that electrons 
entering it from N m have a good chance of diffusing through it before com- 
bining with a hole. Owing to the operation of the emitter, a hole current 

* W. Shockley, Phys. Rev. 78, 294-295 (1950). 
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is injected across /„ through Nb and across J c . These holes tend to be- 
come trapped in P c because of the "hook" in potential which is produced 
by the junction J m . We next consider the current flowing across J m . 
If N m is much more w-type than P c is ^>-type, then most of the current 
across J m will be carried by electrons as discussed in Section 4.2. In 
addition, the small width of P c permits electrons to escape to the left in 
Figure 4.13 without recombining; this also tends to reduce diffusion back 
to N m and hence further to enhance the electron current. As a result the 
electron current flowing across P c to the base will be much larger than the 
hole current flowing across J m to N m . Consequently J m acts as a current 
multiplying junction and causes the hole current arriving at the collector 
to produce a greatly enhanced electron emission. This explanation of high 
a is referred to as the p-n hook theory. 

It may be noted that a high degree of symmetry exists between the right 
and left sides of Figure 4.13, except for the reversal of sign. This sym- 
metry suggests another way of viewing the role of the hook region. This 
region is substantially the same as an electron emitter N m and a p-tyye 
base P c . Hole arrival at P c biases this electron emitter forward and pro- 
duces electron injection into P c and thence to J c , which plays the role of a 
collector junction in both cases. 

The theory of the hook current multiplier is mathematically very 
similar to the theory of the p-n-p transistor for which the basic mathe- 
matical equations have been published elsewhere. 5 

Collector contacts which appear to function by the hook mechanism 
just discussed have been produced by W. G. Pfann, A. E. Anderson, R. M. 
Ryder, R. L. Wallace, and L. Valdes at Bell Telephone Laboratories. 

Values of a as high as 20 or more have been obtained by this means. 

From the previous discussion of the simple space charge case, a limiting 
value of 1 + b was set for a;. If the value of b could be increased, it is 
evident that the hole space charge would be relatively increased and a 
would be raised. One possible process which might have this effect is that 
of "hole trapping." If a concentration of centers which had the property 
of binding holes tightly could be produced directly in front of the collector, 
then the holes would tend to accumulate there with a resultant increase 
in space charge. If we suppose that each hole spends, on the average, a 
fraction x of its time in these traps and a fraction 1 — x of its time moving 
with normal mobility, then the net effect will be to reduce its mobility 
by a factor 1 — x and to raise the effective value of b to b eS = b/{\. — x). 
If (1 — x) is small, this process will lead to very large values of b e g and 
hence to large possible values of a. 

Of the various mechanisms proposed for high values of a, it appears that 
the p-n hook is in best agreement with the facts and that considerations 

5 W. Shockley, Bell Syst. Tech. J. 28, 435-489 (1949). 
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based on this idea have good promise of leading to advantageously de- 
signed units. 

4.6. Phototransistors and Counters. The phenomenon of current multi- 
plication at a collector and the phenomenon of optical generation of holes 
and electrons have been combined by J. N. Shive in the phototransistor. 1 
The phototransistor is somewhat similar to the coaxial transistor, 2,3 which 
consists of a thin wafer of germanium with emitter and collector terminals 
on opposite sides. In the phototransistor, the emitter is lacking and holes 
are generated in the w-type germanium by light absorption. These holes 
then flow to the collector and produce a multiplied photocurrent. Thus 
if each photon produces one hole-electron pair and the hole in reaching 
the collector provokes the emission of two electrons, a* = 3, then three 
electronic units of charge will be produced per photon absorbed. The 
ratio of number of electronic charges produced per photon absorbed is 
called the quantum efficiency; for the phototransistor this efficiency can be 
greater than 100% because of the,**; of the collector. 

The quantum efficiency of the light absorption process itself has been 
studied by Goucher. 4 In his experiment the change in conductance of an 
K-type germanium filament due to illumination was measured. This 
change should be proportional to the total number of holes present, the 
formula being given in problem 3 of Chapter 3. The number of holes 
present is equal to the rate of generation times the lifetime, as may readily 
be established by the methods of Section 12.6. The rate of generation is 
equal to the quantum efficiency times the rate of photon absorption, which 
is determined from the energy and wave length of the light and the optical 
constants of the germanium. In Goucher's experiments the lifetime was 
measured by hole injection techniques like those discussed in Sections 
3. Id and 12.6. By comparing the change in conductance with the rate of 
photon absorption, Goucher concludes that over a range of about 1.0 to 
1.8 ix each photon absorbed produces one hole-electron pair. 

Collector junctions have also been used by K. G. McKay 5 to detect 
holes produced in the germanium by alpha particles. He finds that the 
sensitive region has a diameter of between 10~ 3 and 10 -2 cm and that the 
maximum pulse height corresponds to the passage of 10 6 electrons. The 
pulse width was less than 0.05 microseconds. McKay has also suggested 
the use of large area p-n junctions for counters and such effects have been 
observed by Orman, Fan, Goldsmith and Lark-Horovitz. 6 

1 J. N. Shive, Phys. Rev., 76, 575 (1949). 

2 J. N. Shive, Phys. Rev., 75, 689-690 (1949). 

3 R. L. Wallace and W. E. Kock, Elect. Engg., 68, 222-223 (1949). 

4 F. S. Goucher, Phys. Rev., 78, 646 (1950). 
6 K. G. McKay, Phys. Rev., 76, 1537 (1949). 

6 C. Orman, H. Y. Fan, G. J. Goldsmith and K. Lark-Horovitz, Phys. Rev., 78, 646 (1950). 
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p-n junctions themselves are quite effective photocells. 7 It is not 
necessary for the hole-electron pairs to be generated in the junction itself 
to produce a current but merely near enough so that there is a good proba- 
bility of the "injected" carriers diffusing to the junction. This probability 
is determined by the diffusion length discussed in Section 12.5. 

Problems 

1. Assume that the probability that an injected hole recombines in time 
dt is dt/r p . Show that the fraction of the holes injected at time /' which 
are uncombined at time t is exp (/' - t)/r p . (See Section 8.3 for a similar 
analysis of physically different problems.) 

2. Suppose that in a filamentary transistor the emitter current is i ( (t) 
and consists of a fraction y of holes. If the transit time is r ( , show that the 
number of holes in the filament is 

Ptot. = (y/e) f ».('') exp [(/' - t)/r p ]dt' 

and hence, using problem 3 of Chapter 3, that the increase in current is 

&"«(') = (Vj ~ PcfrG*. + i*p)L~~ 2 f ».('') exp [(/' - t)/r v ]dt' 

Ui—Tt 

= 7(1 + V) f i t {t') exp [(/' - t)/r p }dt'/T t . 

Show that this reduces to equations (18) and (19) of Section 4.1 for 
hi c {t)/i t {t) = a e = t(1 + b)P when i t {t) = i f0 exp tut. 

3. A p-n junction has a resistance of R = 400 ohms for low voltages and 
an area of A = 0.5 cm 2 . Assuming that the current is carried chiefly by 
holes and using (1) of Section 4.2, show that 

{e/kT)I ps A = \/R 

so that 

I ps = kT/cAR = \/?>9AR = 1.28 X 10" 4 amperes/cm 2 . 

Assuming that the lifetime of a hole in the w-region is 100 microseconds, 
show from (3) of Section 4.2 that this leads to 

Pn = (D p T p )*/ARe» p = 1.2 X 10 12 cm" 3 

and that this concentration corresponds to a conductivity for holes of 
3.3 X 10~ 4 ohm -1 cm -1 . This value is much smaller than 2.1 X 10 -2 

7 See footnote 1 of Section 4.2. 
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ohm -1 cm -1 corresponding to the conductivity of pure germanium at room 
temperature and shows that the high resistance of a p-n junction arises 
from the difficulty of forcing hole current to pass through »-type material 
where there are very few holes to carry it. This subject is discussed more 
fully in Chapter 12. 
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CHAPTER 5 

QUANTUM STATES, ENERGY BANDS, AND 
BRILLOUIN ZONES 



5.1 INTRODUCTION TO CHAPTERS 5, 6, AND 7 

In non-quantum-mechanical treatments of conductivity, the current 
carriers are thought of as small particles moving in random thermal mo- 
tions on which is superimposed the effect of electric and magnetic fields. 
Except for their smallness, the particles are imagined to have much the 
same attributes as larger solid bodies such as are encountered in ordinary 
experience, golf balls, for example. Although non-quantum-mechanical 
theories have failed to account for many aspects of the behavior of elec- 
trons, quantum mechanics, which includes wave mechanics, has succeeded. 
Quantum mechanics brings with it a set of new concepts which include the 
"wave function", "probability density", "energy level", "spin", "quantum 
state", and "Pauli exclusion principle" — concepts which are based on the 
development of the mathematical 'postulates of the theory. 1 In general, 
a mastery of this mathematical machinery is achieved only in a post- 
graduate course in physics. Fortunately, the results of the application of 
quantum mechanics to the motions of electrons in crystals do not need to 
be described in terms of the mathematical machinery itself but can be 
presented, instead, largely in pictorial terms. It is the purpose of Chapters 
5, 6, and 7 to develop this picture. 

Before encountering the details of the argument, the reader will benefit 
from a preview of the end result to be reached, together with some idea of 
the route to be followed. There is, in the author's opinion, a definite value 
in traversing a route from which one may view, at least superficially, the 
detailed processes used in developing a theory. The alternative would be 
to skip these intermediate steps and to present, at once, the final physical 
picture of the properties of holes and electrons as they emerge from the 
theory. This alternative seems undesirable for several reasons. It gives 
no idea of the fundamental concepts on which the behaviors of holes are 
based and thus opens the possibility of error due to applying the picture 
beyond its range of validity. Furthermore, from the purely scientific 
viewpoint, a major theoretical triumph is achieved by showing how the 

1 Chapter 14 is an introduction to some of these ideas. It follows an unconventional course 
And stresses analogies with transmission line theory. 
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electrons, through the mechanism of hole conduction, can simulate the 
behavior of positive particles. 

The objective aimed at in this chapter and the next is a description of the 
behavior of an electron in an ideally perfect crystal, in which the perfection 
of the atomic arrangement is undistorted by thermal vibrations, impurity 
atoms, or other causes. The electron is found to have certain permitted 
modes of motion, quantum states. In these, it has certain energies and 
velocities. A consequence of Schroedinger's wave equation is, however, 
that the classical relationship S = mv 2 /2 does not apply to electrons in a 
crystal; and a further consequence is that the momentum is not equal to 
mv. A "crystal momentum", P, defined in Section 5.S in terms of the wave 
functions for electrons is introduced. As we find in Chapter 7, this vector 
quantity satisfies the equation of dP/dt = F when a force F acts on the 
electron, a relationship which justifies calling it momentum in keeping 
with Newton's second law of motion. Unlike the momentum of a free 
particle, the crystal momentum of an electron is, in a certain sense, limited 
in range. Because of the repetitive or periodic property of the potential 
field of the crystal in which the electron moves, we need not consider larger 
values of momenta than those lying in a certain region, called the Brillouin 
zone, in the three-dimensional P-space. 

The energy and velocity of an electron are determined by the value of 
P inside the Brillouin zone. The relationship is multivalued, however, so 
that for a given value of P there will be a large number of possible quantum 
states, each with its characteristic energy and velocity. The various sec- 
tions of this chapter are used to develop these ideas, the mathematical 
details being given in Part III. As a result we obtain a description of the 
behavior of an electron in the crystal in terms of energy, velocity, and 
momentum which are not related in the same way as they are in classical 
mechanics. 

As an aid to the reader in following or reviewing the argument, Figures 
5.1, 6.1, and 6.3 have been prepared. These indicate the logical sequence 
of the argument, and, where the steps involved have been presented in the 
text, reference numbers are given. These three figures represent the 
development of the Brillouin zone concepts of electron motion in a perfect 
crystal 2 under the condition for which no external forces act. 

When an external force acts on the electron, then, as a consequence of 
the wave equation, it is found that the crystal momentum changes accord- 
ing to the law 3 dP/dt = P = F. For the analysis of the conductivity and 
Hall effect, forces due to electric and magnetic fields must be considered. 
The arguments used in deriving the basic acceleration law are shown in 

2 This requirement of perfection excludes thermal vibrations or impurity atoms and thus 
eliminates the random processes which give rise to resistance. 
s We shall occasionally use the symbol "•" for "d/dt" or "d/dt". 
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Figure 7.1 together with some of the consequences of this law. This 
material is discussed in Chapter 7, which deals with the dynamics of 
electrons in a perfect lattice. 

The relationships of energy d>, velocity v, and momentum P and the law 
of force P = F lead to a determination of the effects of applied fields on the 
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Fig. 5-1 — The Brillouin Zone Method of Analyzing Bloch Wave Functions for 

Electrons in Crystals. 

motion of an electron. The situation is quite like that for the classical 
case except for the different relationships between tb, v, and P. These 
differences are essential in explaining the processes in electrical conductors. 
On the basis of the new relationships between &, v, and P, the behaviors of 
an excess electron and of a hole are analyzed. The results justify the 
description of holes and electrons as classical particles given in Chapter 1. 
This is the principal aim of Chapters 5, 6, and 7. 

In later chapters, the effect of random processes which introduce electrical 
resistance and establish thermal equilibrium is discussed. These processes 
are caused by disturbances that make the crystal imperfect and are thus 
automatically excluded from Chapters 5 to 7, which are restricted to a 
discussion of a perfect crystal in which each atom is free from thermal 
motion and is exactly located at its prober lattice point. 
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5.2 QUANTUM STATES, SPIN, AND THE 
PAULI EXCLUSION PRINCIPLE 

5.2a. The Nature of the Solutions of Wave Equations. An electron, as 
is now generally accepted, is a very small particle whose laws of motion are 
governed by Schroedinger's wave equation. 1 Schroedinger's equation 
combined with the Pauli exclusion principle furnishes a satisfactory basis 
for explaining most of the phenomena occurring in solid state physics and 
chemistry. We shall discuss how its application leads to the theory of 
energy bands, Bloch wave functions, and Brillouin zones. But first, as a 
necessary introduction, we shall comment briefly on the quantum theory 
for one-electron systems. 2 

When an electron moves in three-dimensional space (with its position 
described by coordinates x, y, z), under the influence of forces which can be 
represented by a potential energy G U(^,J', z), its possible energies S (not a 
function of x,y, z) and modes of motion are described by Schroedinger's 
wave equation: 

where h = Planck's constant; 

m = the mass of the electron; 

5 = the energy of the electron ; 

t|r = \\i(x, y, z), an unknown wave function. 

We shall discuss the meaning of the wave function \|r below after we have 
compared this equation to several other equations of mathematical phys- 
ics. 3 Thus, the wave equation for the scalar or vector potential of an 
electromagnetic wave oscillating with frequency v is 

4(5* + •* + £) + ,* -a (2) 

4t 2 \dx 2 dy 2 dz 2 J 
In Figure 5.2 (f) to (i) we represent a stretched membrane (drum head) 

1 This equation is known to be inexact but it applies well so long as the speed of the electron 
is small compared to the velocity of light. It stands in relation to the Dirac equation as 
Newton's laws of motion do with respect to Einstein's relativistic equations. 

2 This treatment follows in an abbreviated form one presented by the author in "The 
Quantum Physics of Solids I", Bell Syst. Tech. J. 28, 645-743 (1939), a publication which is 
available on request to the Publication Department of Bell Telephone Laboratories as Mono- 
graph B-1184. Most of the material originally planned for II of the series, which was never 
written owing to the interruption of the war, has been incorporated in the present book. ^ 

3 A more detailed discussion of the interpretation and solution of Schroedinger's equation, 
together with a comparison between it and an electrical transmission line, is given in Chapter 
14. 
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Fig. 5-2 — Wave Functions of an Electron With Some Mechanical and Electrical 
Analogues, [(f) to (i) are from "Vibration and Sound", by P. M. Morse, McGraw- 
Hill, New York, 1937, Courtesy of the McGraw-Hill Book Co.] 
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whose vertical displacement <j> satisfies the wave equation 

77" -77 + 77 ) + " 2 4> = (3) 

4x 2 <r \6V dy 2 / 

where a is the mass per unit area, T the surface tension, and v is the fre- 
quency. This equation has solutions, subject to the boundary condition 
that <j> = on the rectangular boundary where the membrane is clamped, 
only for certain frequencies called the "normal modes" or "natural fre- 
quencies" of the drum head. These normal modes can be described by a 
set of "quantum numbers" n x and n y as shown in Figure 5.2. There is 
some choice as to the exact convention most suitable for the quantum 
numbers, and the ones selected for (f) to (i) have been chosen to conform 
to those later used for Brillouin zones. These quantum numbers are 
defined as the number of wave lengths of the vibration in each of the two 
directions so that each is equal to \ + ^ (number of nodal lines, heavy 
arrows, in the figure). 

Another system, whose normal modes may be familiar to the reader, is 
the length of transmission line with choke coils across its ends (j). (This 
structure and the simple harmonic oscillator of (d) and (e) are introduced 
for use later in describing what happens when isolated atoms are brought 
together to form a crystal.) 

Part (a) shows the potential energy function ^ for an electron in a 
hydrogen atom. It is defined so that the energy is zero when the electron is 
infinitely far away from the proton; consequently, ^ = — e 2 /r where e is 
the electronic charge and r the distance from the proton. When this 
potential energy is put in Schroedinger's equation, solutions for v|/ which 
satisfy the proper boundary conditions (\J/ = when r approaches infinity 
in this case) can be obtained only for certain values of £>. These solutions 
are called "eigenfunctions" or "proper functions", and the energies are 
called "eigenvalues" or "proper energies". It is found that these wave 
functions, two of which are shown in (b) and (c), can be described by a set 
of quantum numbers like those for the rectangular drum head. For ex- 
ample, the 1 in U and 2 in 2s are each just one more than the number of 
times the value of i|/ crosses zero as a function of r. 

A feature to be stressed about Schroedinger's equation is that the 
quantum numbers are not introduced in it as a special assumption. 
They arise in just the same way as those shown with the rectangular drum 

4 There may be some choice about the numerical values assigned to the quantum numbers. 
Thus in Figure 5.2, n x and n y might have been chosen to be the number of half wave lengths 
: along each edge, so as to avoid half integers. However, the point we wish to make is that no 
special assumption is made in order to introduce quantum numbers in the first place. This 
was not true of the old Bohr theory; in it quantum numbers were introduced as a special 
assumption. 
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head of Figure 5.2 or the normal modes of a resonant cavity. The mathe- 
matical aspects of all of these problems are very similar and all involve 
solving a differential equation subject to certain boundary conditions. 
When the proper wave functions are examined in detail, it is usually found 
that they are characterized by having a certain number of wave lengths or 
half wave lengths in the x, y, and 2 directions (or in three other directions 
depending on the boundary conditions). These numbers of half wave 
lengths are thus naturally arising quantum numbers which can be used to 
characterize the solution of the wave equation. For an electron in a 
hydrogen atom, the most suitable quantum numbers are simply related to 
the number of half wave lengths (or nodes in the wave function) in the r, 
6, and tp directions in spherical coordinates. For our purposes, however, 
we do not need to refer to the detailed meaning of quantum numbers of 
electrons in atoms. 5 We shall be, however, much concerned with the 
quantum numbers used to describe the motions of electrons in crystals 
and will give these an interpretation in terms of the wave lengths of the 
wave functions involved (and thereby introduce the crystal momentum P). 

In connection with the motion of electrons through crystals, we shall 
later have occasion to compare the behavior of electron wave functions 
with those of electromagnetic waves or mechanical waves. For this reason 
the interpretation of the wave functions in both cases should also be 
compared. 

According to quantum mechanics, the wave function \|; is a "probability 
amplitude" (a phrase developed in quantum mechanics and not found else- 
where), and the square of its absolute value, denoted by |i|r| 2 = |\|/ | = 
i|»*i|f, is a probability density. 6 The probability density has the following 
interpretation: the electron in its wave function is to be thought of as a 
very small particle with extent negligible compared to the size of the atom. 
It itself is not spread out over the volume occupied by the wave function. 
However, in its motion it traverses the space about the atom spending 
varying amounts of time in each region. The probability density at any 
point is simply proportional to the amount of time the electron spends (per 
unit volume) in a small volume about the point. In fact, the scale factor 
of the wave function is usually so adjusted that |i(r|Vr is simply the 
traction of time the electron spends in the volume dV. As a consequence, 
W*dV is the probability of finding the electron in the volume dV. (\|r 

5 An elementary discussion of this topic is given in the Monograph B-l 184. previously 
referred to. 

6 As discussed in Chapter 14, + itself may be regarded as a mathematical aid used in 
applying quantum mechanics to explain the behavior of electrons. There is even less purpose 
served in discussing what <|/ is a displacement of than in discussing what kind of an ether is 
being stressed by an electromagnetic wave. For any set of values of x, y, z, and /, + is a 
complex number which can be written as u + iw where u and w are real numbers. |+] = the 
positive square root of u 2 + w 2 and +*+ = u 2 + w 2 . 
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itself has thus the dimensions of cm - ^.) When these probabilities are 
added up for all space, the sum is unity since the electron must be some- 
where. A wave function which adds up to unity in this way is said to be 
normalized. 

Consequently, |\|/ 2 | gives the long-time-average charge-density distri- 
bution produced by the electron. It is such average-charge densities that 
are represented in Figures 1.1, 1.2, 1.4, and 1.14. (These take the place of 
the Bohr orbits of the old quantum theory.) For many purposes, the 
electron moves so quickly over its possible positions that this average 
itself can be used. For other cases, such as for van der Waals' forces, it is 
necessary to take into account the fluctuations in position of the electron. 
Other aspects of the electron's behavior, such as its average velocity, can 
be obtained from the wave function in ways discussed in Chapter 6 and 
Chapter 14. 

The probability density behaves in a way closely analogous to the energy 
density of electromagnetic or mechanical vibrating systems, provided that 
these have no loss, so that the energy of oscillation is conserved. For 
these cases the voltages and currents (or electric and magnetic fields) and 
the displacement and velocity are the amplitudes involved. The energy 
densities are proportional to the squares of these amplitudes. There is an 
important difference, however: In the electromagnetic or mechanical cases, 
the solutions are usually obtained in complex form, and, for these cases, the 
real part and the imaginary part of the solution are each individually 
equally meaningful (in the physical rather than mathematical sense) as 
solutions of the problem. Each of these solutions shows a transfer back 
and forth, twice each cycle, of electric to magnetic energy or of kinetic 
to potential energy. For the quantum mechanical case, on the other 
hand, the wave function is complex, its time dependence is of the form 
ex p (_ 2iri&t/h) where & is the energy, / is time, and h is Planck's constant, 
and neither the real nor imaginary part of v|/ is alone a solution of Schroe- 
dinger's equation. Only the complex wave function itself is. The value 
of |\|r 2 | is thus independent of time and represents an unvarying probability 
distribution. 7 This is a natural result, since unlike energy density, which 
can exist in two forms, both for the electromagnetic and for the mechanical 
case, there is only one form of probability density so that the possibility of 
exchanging between one form and another is excluded. (There is, how- 
ever, an exchange between the real and imaginary parts of the wave func- 
tion \|/, which has a bearing on the wave velocities discussed later.) 

In the cases of classical waves and in the quantum mechanical case as 

7 While |<|» 2 | is always independent of time for the normal mode solutions or eigenfunction 
solutions of Schroedinger's equation, there are other solutions in which |+ 2 | varies with time 
and the probability distribution moves in space. Such cases are described in connection with 
wave-packets, Section 6.3. 



5.2a] 



QUANTUM NUMBERS 



127 



well, it is sometimes of value to get a new wave function by adding together 
other wave functions. This addition is permitted because of the super- 
position principle which arises from the linearity of the differential equa- 
tions involved. Considerations of wave functions built up by superposi- 
tion are particularly helpful in understanding the bodily motion of the 
electron through empty space or through a crystal and we shall return to 
them in Section 6.3. 

The important feature about the normal modes of any system, such as 
the drum head, is that any free oscillation for that system can be expressed 
in terms of them. Thus if the drum head is struck a sharp blow and then 
allowed to oscillate, its motion will not be represented by a single one of 
the normal modes of Figure 5.2 but instead by a combination of the effects 
of several normal modes, each oscillating with its own proper frequency, 
independently of the others, the total displacement being given by the sum 
of the component displacements. A disturbance of this sort does not have 
a single frequency and satisfies a wave equation containing a derivative 
with respect to time: 



a \dx 2 



+ By 2 



)~ 



dt 2 



0. 



(4) 



The mathematical expression for the solution of this equation is obtained 
by adding together the solutions for the normal modes as follows: If 
$i( x > y)> <t>2 (•*"» jy ) " • • are normal mode solutions with frequencies/i, /2, • • •, 
then a general solution of the time dependent equation is 

$(x, y, t) = Ai$i(x,y) sin 2w/i(/ - h) 

+ A 2 $ 2 (x,y)sm2wf 2 (t- t 2 ) -\ (5) 

where A\, A 2 , • • • and /i, t 2 • ■ • are constants. For each of these normal 
mode terms individually, the time-dependent equation reduces to the one 
containing/ 2 and no time derivatives. 

The corresponding time-dependent Schroedinger equation is 
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2x dt 



= 0. 



(6) 



It can also be solved by adding together eigenfunction solutions, each 
multiplied by a time-dependent term. 8 We shall have occasion to con- 
sider wave functions constructed in this way in connection with wave- 
packets, which represent the motion of the electron through the crystal, in 
Chapter 6. In this chapter, however, we shall deal only with eigenfunc- 
tion solutions. 

'Such time-dependent terms are considered in Chapter 14 and are also used in Equa- 
tions (2) and (5) of Section 5.4. 
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5.2b. Spin and Quantum States. In addition to its motion through 
space, the electron also rotates about its own axis. This rotation intro- 
duces effectively a fourth degree of freedom and a fourth quantum number 
■ — the "spin". For reasons associated with the theory of relativity, the 
fourth quantum number is found to have only two permitted values, 
denoted by +| and — \. In considering the cohesive forces between 
atoms certain "exchange" effects associated with the spin are of great 
importance. The spin of the electron also produces the magnetic moment 
which is responsible for ferromagnetism. For the purpose of treating the 
motions of electrons in semiconductors, however, we are concerned only 
with the fact that the spin gives the electron an extra degree of freedom so 
that it may move in accordance with a given wave function either with 
plus spin or with minus spin. 

We are now in a position to introduce the concept of a "quantum state". 

A quantum state describes a possible mode of behavior of an electron. It is 
specified by stating its four quantum numbers, three describing the wave func- 
tion and the other the spin. 

So far our discussion has been concerned with the behavior of a single 
electron. When several electrons are simultaneously present and exert 
forces of electrostatic repulsion on each other, it is no longer exact to think 
of each electron as moving in a constant force field, and, consequently, it 
is not exact to speak of quantum states for individual electrons but instead 
only of the quantum state for the system as a whole. However, when 
large numbers of electrons are present, certain approximate methods may 
be used. According to these, when one electron is being considered, the 
effect of the others may be averaged to give a force field which is then taken 
as fixed and definite. The motion of the electron in this average force 
field is then investigated, and its quantum state is determined. The same 
process is carried out for all the other electrons also. (This process is 
usually repeated until a "self-consistent" solution is obtained: that is, 
starting from a set of trial wave functions for the electrons, the force field 
on each individual electron due to the other electrons is worked out. From 
these force fields a new set of wave functions is found. This process is 
repeated until a set of wave functions is obtained which produces a force 
field which gives the same set of wave functions back again.) To the de- 
gree of approximation used in the theory of semiconductors, this averaging 
of the force field is accepted as satisfactory, and the behavior of each elec- 
tron is described by specifying the quantum state it occupies. 

5.2c. Pauli Exclusion Principle. In terms of the quantum state concept 
we may now state the "Pauli exclusion principle" which is a foundation 
stone in the theory of matter. 

The Pauli exclusion principle states simply that no two electrons may occupy 
one and the same quantum state. 
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If it were not for the existence of a law of nature of this form, all of the 
electrons in complex atoms would drop to the lowest energy level and the 
structure of the periodic table, the diversity of chemistry, and the reader of 
this book would not exist. 

In applying the Pauli exclusion principle, care must be taken to dis- 
tinguish properly between quantum states. Thus the exclusion principle 
does not require that, if one electron is in the lowest energy quantum state 
in a particular hydrogen atom, then no other electron in the universe can 
be in the same quantum state on another hydrogen atom. In fact, in a 
gas of hydrogen atoms, we should expect half the electrons to be in the 
lowest quantum state with one spin and the other half in the state of 
opposite spin. This apparent paradox is resolved by noting that, when 
we have two atoms, the quantum state scheme is essentially doubled. In 
fact the wave functions around two nuclei require a new set of quantum 
numbers. It is found, however, that, if the atoms are far apart, the wave 
functions themselves are simply what they were for one atom and are ob- 
tained twice over, once centered about one atom and next about the other. 
In other words, two separate atoms do not have the same quantum states; 
each of them has wave functions of exactly the same shape which are 
located at different points in space and correspond to distinct quantum 
states, one set for each atom. 

The mathematical formulation of the Pauli exclusion principle is ex- 
pressed by requiring that the wave function representing the system of 
electrons be "antisymmetrical". The meaning of this term and various 
other mathematical aspects of the Pauli exclusion principle are discussed 
in Section 15.7. It is not necessary, however, to study these analytical 
features in order to understand how the principle is applied in this and 
the following chapters. 



5.3 QUANTUM STATES AND ENERGY BANDS IN CRYSTALS 

There are two principal procedures for introducing the systems of quan- 
tum states in a solid. In one of these the average field method discussed 
in the last section is used to determine the individual motions of the elec- 
trons in the solid. It is necessary to use this method in describing the 
momentum, acceleration, and velocity of charge carriers in semiconductors. 
The other procedure traces the quantum states in the solid back to their 
origin in the isolated atoms of which the solid is composed. We shall treat 
the latter method first. 1 

Accordingly, we imagine that the N s atoms of the solid are arranged 
perfectly on the crystal lattice in question but that they are separated by 

1 This treatment follows closely, in some respects, that of W. Shockley, "The Quantum 
Physics of Solids I", Bell Syst. Tech. J. 17, 645 (1939), Monograph B-1184. 
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p IG 5-3—A One-dimensional Crystal and Some Mechanical and Electrical Ana 
logues. [An electron, because of the wave mechanical laws that govern it, can move 
through a region where its potential energy is. greater than its total energy, as shown 
in (a). In this region its wave function is damped like an electromagnetic field in a 
wave guide when the frequency is below cutoff.] 
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many times the normal lattice constant so that there is substantially no 
interaction between them. 2 The quantum state distribution for the 
crystal is then essentially that of one atom duplicated N s times. That is, 
each energy level corresponding to an atomic quantum state occurs N s 
times, once for the corresponding quantum state on each atom. However, 
as the lattice constant is reduced the situation changes and, when the atoms 
are so close together that appreciable overlapping of the wave functions 
between adjacent atoms occurs, the quantum state and energy level schemes 
are profoundly modified. The result of this modification is shown pic- 
torially in Figure 5.3. In parts (h) to (n) of the figure, the effect of inter- 
action is illustrated in terms of a set of one-dimensional mechanical oscil- 
lators. / The interaction between atoms is represented here by weak springs 
coupling the oscillators together. When the coupling is zero, each oscil- 
lator has its own normal mode of vibration, all of the same frequency. As 
soon as coupling occurs, the vibrations can no longer be restricted to one 
oscillator but extend throughout the system. However, if the coupling is 
weak, each of the modes (i) to (n) will have almost the same frequency as 
an isolated oscillator. As the coupling becomes stronger, the band of 
frequencies spreads over a wider range. 

A very similar behavior occurs for the atoms. As soon as the wave 
functions overlap, the quantum states are no longer restricted to single 
atoms but instead extend over the entire crystal as is suggested in (b) to (g) 
of Figure 5.3. The energy levels corresponding to the quantum states split 
in much the same way as do the frequencies of vibration for the system of 
oscillators. In this way each atomic energy level leads to a band of energy 
levels or energy band in the crystal. This is illustrated in Figure 5.4 which 
shows qualitatively the results predicted from Figure 5.3. 

In the process of splitting, the total number of degrees of freedom of the 
mechanicaToscuTators and the number of quantum states for the atoms are 
invariant. Thus, for example, if the six atoms of Figure 5.3 contained 
twelve electrons, so that the lowest quantum state of each spin was oc- 
cupied, then there would still be jaisi^nough quantum states to accommo- 
date all .the electrons in the band of energy levels produced by the interac- 
tion : there would be six wave functions each giving rise, because of spin, 
^ 

to two quantum states. 

For systems of larger numbers of atoms, there will be more wave func- 
tions. However, the one of lowest energy will have a slowly varying 
modulation wave like that of the dashed line in Figure 5.3(b) and the one 

2 The crystal lattice describes the relative arrangement of the atoms. The diamond 
structure of Figure 1.3 is common to carbon, silicon, germanium, and one modification of tin. 
Ordinarily by "lattice constant" one means the value observed in nature. In this chapter, 
however, we use "lattice constant" as a variable and "normal lattice constant" for the one 
observed in nature. 
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of highest energy will be simply a continuation of (g) to more atoms. 
Hence the upper and lower energies for the electron wave functions, or 
frequencies for the normal modes of the analogues, will be independent of 
the length of the structure, provided that it contains a large number of 
units. The conclusions just reached can be summarized in two basic 
theorems for energy bands: 




is 

QUANTUM 
STATE 



LATTICE CONSTANT *■ 

{** STRENGTH OF COUPLING) 

FlG 5-4-Dependence of Energy Levels upon Lattice Constant, or Frequency of 
Vibration upon Strength of Coupling. 

the band width theorem: The width of the energy band arising from an 
atomic energy level will be independent of the number of atoms tn the crystal. 

THE THEOREM OF THE CONSERVATION OF QUANTUM STATES: The number 

of quantum states in the energy band will be the same as the number of atomtc 
quantum states from which the band was produced. 

There are no quantum states with energies between those shown in the 
energy bands. For this reason the energy range running from the top of 

'We shall be intimately concerned with waves like (g) for which the wave length X is 
twice the period - of the structure, in the remainder of this chapter. Such waves correspond 
to the edges of the Brillouin zone. . , , 

Mf we imagine that the lattice constant can be varied as shown m F.gure 5.4, then the 
width of the band depends on the lattice constant. It also depends upon the energy level 
from which the band aross. 
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one band to the bottom of the next is referred to as a. forbidden energy band 
or energy gap. The energy bands containing quantum states are referred 
to as allowed energy bands; although, where confusion is unlikely to arise, 
they are usually referred to simply as "energy bands", as was done in the 
two theorems. 
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Fig. 5-5 — Energy Bands for Diamond versus Lattice Constant. 

* The unit in this figure is the Rydberg = 13.5 electron volts = | ths atomic unit of 
Appendix A. 

For the case of carbon, the important energy bands arise from the 
quantum states occupied by the valence electrons. These are denoted as 
the Is and 1p states and contain a total of eight quantum states per atom. 
They are, therefore, half occupied in the isolated atoms. Each atom also 
contains two electrons in the Is state. However, these electrons have 
their wave functions so compactly bound to the nucleus that no appreciable 
overlapping between adjacent atoms occurs. They can be considered, 
therefore, as unaffected by their being in the crystal. The energy bands 
for carbon 5 in the diamond structure are shown in Figure 5.5. At large 
lattice constants, the lower band is filled and the upper partially filled. At 
smaller distances, however, the eight states per atom are split equally 
between the two bands so that the lower band is completely filled. This 
lower band corresponds to electrons in the valence-bond structure. To 

*G. E. Kimball, /. Chetn. Phys. 3, 560 (1935). 
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excite an electron requires enough energy to raise it to a quantum state in 
the upper band. It is to be noted that the average energy of an electron 
in the lower band at the actual lattice constant is much less than for the 
free atoms. This drop in energy is associated with the "binding energy" 
of the crystal. A comparison of the total energy of a set of isolated 
carbon atoms with that of the same atoms arranged in the crystal shows 
that the loss in energy of the valence electrons makes the latter much more 
stable. The energy required to pull the crystal apart into free atoms is 
called the binding energy. For a valence crystal, like diamond, the 
approximations employed in the energy band method of calculation are 
probably less accurate than those dealing directly with the electron-pair 
bond so far as binding energy is concerned. However, our main interest 
in quantum-mechanical theory centers about the problems of electrical 
conductivity rather than those of cohesion, and for conductivity the 
energy band theory is not only the best approximation currently available 
but apparently an adequate one as well. 

5.4 BLOCH WAVE FUNCTIONS IN A PERIODIC 
POTENTIAL FIELD 

In the last section, we concluded that the energy levels of an array of 
isolated atoms are split into energy bands when the atoms are brought close 
together to form a crystal. By analogy with the behaviors of classical 
systems, the theorems on band width and conservation of quantum states 
were developed. We must next consider the wave functions obtained from 
the viewpoint of their nature as solutions of the wave equation for an 
electron moving in a periodic potential. This will form a basis for later 
discussions of the velocity and acceleration of the electrons. 

The solutions obtained for Schroedinger's equation for an electron in a 
crystal depend upon three attributes of the problem considered: 

(1) The form of the potential field in which the electron moves: that is, 
the value of %(*, J', z) at each point in one unit cell of the crystal. 
(Since the crystal is periodic the value will be the same in every 
unit cell.) 

(2) The size of the crystal. 

(3) The boundary conditions at the surface of the crystal. 

The nature of the problem, as discussed in Section 5.2, leads to the intro- 
duction of a set of quantum members which can be used to specify the wave 
functions and the associated characteristic energies. 

In order to deal with running waves which carry current through the 
crystal, we shall consider a type of boundary condition different from that 
implicitly introduced in connection with Figure 5.3. We shall see below 
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that this change in boundary conditions does not affect the theorem of the 
conservation of quantum states. 

The boundary condition shown in Figure 5.3 is one for which the wave 
function vanishes at the ends of the one-dimensional crystal. We shall 
refer to this case as "vanishing" boundary conditions. These boundary 
conditions are mathematically equivalent to setting up infinite force fields 
just outside the crystal which reflect the electron inwards. Under these 
conditions the electron wave is a standing wave which can be regarded as 
made up of two running waves. However, each of these running waves 
individually, although it does satisfy Schroedinger's equation for the 
particular potential G U, is not a solution which satisfies the boundary condi- 
tions. The normal modes for the coupled oscillators similarly satisfy 
boundary conditions for which the end points of the half-coupling springs 
at the ends of the row have no displacement; the dashed modulating waves 
show this behavior by having nodes at the end. For these boundary condi- 
tions, the electron waves (b) to (g) do not carry any probability density 
along the crystal nor do the mechanical oscillations (i) to (n) carry any 
mechanical energy. In order to deal with eigenfunctions or normal modes 
which support a flow, it is most convenient to use periodic boundary 
conditions. 

Periodic or cyclic boundary conditions amount to doubling the crystal 
back on itself or arranging the coupled oscillators in a closed loop. Such 
an arrangement is shown in Figure 5.6. The six oscillators are now mounted 
below a rigid ring and are coupled together as for Figure 5.3. Some means 
of constraint is assumed which restricts their motion to the vertical axis. 
The lowest frequency of oscillation is that shown in (a). For this ca B <:, all 
the bobs move together and the coupling springs are inactive. In (b) the 
motion is given by a sine wave function which has one wave length around 
the loop. In this case, the coupling springs between (2) and its neighbors 
(1) and (3) exert a restoring force on (2) and cause the frequency of the 
normal mode of (b) to be higher than that of (a). [In order to simplify 
the diagram, the coupling springs have been omitted from the figure except 
in (a).] There are two normal modes for (b), one for each direction of 
travel of the sine wave. The frequency of these waves would be the same 
as (j) in Figure 5.3; in fact, the standing wave (j) can be obtained by 
allowing the right- and left-hand waves of Figure 5.6(b) to interfere. 
These two waves are given the quantum numbers n x = +1 and n x = — 1: 
+ for the wave running towards -\-x and — for the wave running towards 
—x, and 1 because the wave has one wave length around the crystal. In 
accordance with this convention, we have the relationship ±1/X = n x /A x 
= n x /N x a, which we shall use later. 

Part (c) of Figure 5.6 shows a wave with two wave lengths around the 
loop, n x = ±2. There are again two normal modes, with waves running 
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to the left and to the right each having the same frequency as (1) of 

Figure 5.3. 

A critical condition occurs for X = la, however. For this case, both the 
right-going and left-going waves produce exactly the same motion of the 
bobs,, in this instance a very simple one in which the even-numbered bobs 
have identical harmonic oscillations just 180° out of phase with odd- 
numbered bobs. . 

We see that there are altogether six normal modes for the periodic 
boundary condition of Figure 5.6, just as there were six for the "clamped 
end" boundary condition of Figure 5.3. However, the proper frequencies 
are somewhat different in the two cases. 1 

We must next consider the solutions of Schroedinger's equation which 
are analogous to the running waves for the coupled mechanical oscillators 
just discussed. Some insight into the relationship between the two prob- 
lems may be gained by writing the equation for the displacement produced 
by a sine wave with wave length X, frequency v, velocity c (equal to Xc), 
and unit amplitude in Figure 5.6. The displacement z of the bob at 
position *, measured along the rim of the circle, may be written in any one 
of several forms: 



= sin 2ir 



= sin lirv 



rZ- V A = sin (2»A)(* - ct ) 

(H- 



(i) 



If we increase x by a, that is, go from one bob to the next, the phase angle 
in the wave is increased by 2ir«/X. This effect is equivalent to replacing / 
by / - a/c. In other words, the oscillation at one bob is just the same as 
at the previous one except that it is delayed by a time a/c, the time required 
for the wave to traverse one period of the structure. The velocity for such 
waves varies with X, an example being giyfnJn_Ch^pjter_15. 

It seems logical to expe^FHianhTsoTuTions to Schroedinger's equation 
for waves in the crystal will give the same behavior in every unit cell, 
except for a lag in phase angle corresponding to the time required for the 
wave to progress from one cell to the next. The proof that waves of this 
sort are eigenfunctions is due to F. Bloch 2 and the corresponding wave 
functions are called Bloch functions. z For a one-dimensional crystal the 
Bloch function is 



fxO,/) = u x (at) exp 



t 2 *^-*)] 



(2) 



1 Some problems bearing on these matters are given at the end of this chapter. 

2 F. Bloch, Z. Physik 52, 555 (1928). 

3 These subjects are discussed in more detail in Chapter 14. 



138 



ENERGY BANDS, AND BRILLOUIN ZONES 



[5.4 



In this U\(x) is the same function in every unit cell; it is periodic so that 

u x (* + a) = UxO). (3) 

It is thus seen that the behavior of ty\(x, t) is just the same at x + a as at x 
after a lapse of time given by 



- = v\t or t = a/X^x; 

A 



(4) 



in other words, the wave progresses to the right with phase velocity \v\. 
The periodic function Ux(^) and the frequency v\ both depend upon X as is 
indicated by the subscripts. U\(x) is, in general, complex, with a varying 
phase angle, although it becomes real for X = ». (For X = 2a, VL\(x) 
varies as exp ( — 2vix/\) so that \|/x has a special behavior as shown in 
Figure 5.7(c).) The energy &\ of the wave function also depends upon X 
and is related to the frequency v\ by the equation 4 



&\ = hv\, 



(5) 



as may be easily verified by taking the partial derivative with respect to 
time in the Schroedinger time-dependent equation. As is always true for 
energyeig^nfunctions^ the value of |\|/x(#, t)\ 2 is independent of~tTme. 
[ThisToIIowsjinalytiqairy "irom"th~e facT^h~aFthe"only time dependent term 
in (2)"is~exp ( — lirivii).] 

The remain^eFoFthis chapter and the next will be devoted chiefly to 
examining the behavior of the Bloch wave functions \|/x(-f, t) and interpret- 
ing the results. The value of X, or better that of ±A/X = hn x / A — 
hn x /N x a, will be found to be a useful "quantum number" to characterize 
thj^s£ateshi_an energy band. Of particular importance is the significance 
of conditionsITEe^fKose" shown in Figure 5.3(g) and (n) and 5.6(d) for 
which the phase difference is 180° for one period of the lattice or, in other 
words, X = la. (These conditions determine the boundary of the Brillouin 
zone discussed in the next section.) 

Figure 5.7 (p. 140) shows some running waves of the Bloch form. In 
order to give an idea of how the wave function xjr and probability density 
|\|/| 2 depend on time and distance, we have plotted the real and imaginary 
parts of \|f and also |iff| 2 in the figure. As described in connection with the 
figure, we see that each wave gives the same probability density at every 
atom. However, the real and imaginary parts of \J/ individually have the 
type of wave motion along the crystal required by equation (2) of Section 

4 This famous equation, which is now a part of the general formalism of quantum mechan- 
ics, was first discussed by Planck in 1900 in connection with thermal equilibrium of electro- 
magnetic radiation in a cavity. In 190S Einstein used it to interpret photoelectric emission 
by supposing that light quanta or photons deliver energy in units of hv to electrons in the 
photoelectric effect. 
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5.4. As we shall see in Chapter 6, this wave motion is associated with 

electron current, the electron being, in a sense, carried along by the wave. 

In terms of these running waves, the condition X = la can be given special 

significance. This is a condition for which the behavior of the running 




Fig. 5-8— Phase Relations between Reflected Waves, Showing Critical Condition 

for X = la. 



wave degenerates to that of a standing wave. Furthermore, the same 
standing wave is obtained whether waves running to the right or to the 
left are used initially. [Discussed in connection with Figure 5.6(d).] 
Because of the fundamental importance of the limiting condition X = la 
in the theory of Brillouin zones, we shall use, in addition to the discussion 
of the wave behavior of Figure 5.7, another argument leading to the same 
result. In Figure 5.8(a) we imagine that a wave / whose sinusoidal 
behavior corresponds to X = la is traversing the crystal from left to right. 
As it passes by each plane of atoms, a certain fraction of it is reflected 
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Fig. 5-7 — The Wave Function 
f = u P (x) exp (iPx/h). 

This figure shows the behavior 
of wave functions for three quan- 
tum states of the lowest energy 
band and some wave functions for 
higher energy bands. 

(a) shows a relatively long wave 
length, X = la. Its real and im- 
aginary parts (al) consist of a part 
with period a multiplied by a sinu- 
soidal part. The sinusoidal parts 
are related as sine and cosine. This 
results in the same value of |\|/| 2 = 
(Reall|/) 2 + (Imag. \|r) 2 in every 
unit cell (a4), since sin 2 + cos 2 = 1. 
Furthermore, the phase angle of 
the wave function, 6 = tan~ * (Imag. 
Xrf/Realvf/), varies linearly with x, 
(a3), and goes through 360° in 7 
lattice constants. As t increases 
from to t\, the phase angle for 
any fixed value of x changes by 
88= —2irv\ti, (a3). This isequiva- 
lent to moving the wave function 
along (SB /2t) wave lengths. The 
phase velocity is, therefore, X(50 / 
2ir) //i = X;/ x . 

(b) shows a wave with X = 
(7 /3)a, only slightly larger than 
the critical value, X = 2a. The 
scale is enlarged to show the more 
rapid variations. Again, the sinu- 
soidal waves for real and imaginary 
parts of \|/ are 90° out of phase at 
the centers of the atoms. However, 
between atom centers, the phase 
angle changes irregularly, tending 
to remain constant near each atom 
center, then changing abruptly be- 
tween atoms where the wave func- 
tion is small. If U\(x) were real 
or had a constant phase angle, the 
phase angle for i\i would vary 
linearly with (x). However, U\(x) 



(C3) 
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is, in general, complex and can be 
taken as real only for X = 0. 

(c) For X = la, the wave func- 
tion is multiplied by - 1 (a phase 
change of 180°) between one atom 
and the next. For the case shown, 
the wave function is entirely real 
(at another time it will be entirely 
imaginary due to theexp (—2iriv\t) 
time dependence). The transition 
of phase by 180° occurs discontin- 
uously at the midpoint between 
atoms where the wave function 
vanishes so that the phase is in- 
determinate. These discontinuities 
have been drawn like a left-handed 
screw in keeping with (a) and (b). 
However, exactly the same be- 
havior of v|/ would result if the dis- 
continuities were like a right- 
handed screw. Thus, the same 
behavior of the wave function for 
X = 2a results for waves running 
initially in either direction. In 
other words, this is a standing 
wave. 

(d) shows some real and im- 
aginary parts of wave functions 
for electrons in the conduction 
band for metallic sodium as com- 
puted by Slater.* We include them 
to show that numerical calcu- 
lations of wave functions of the 
Bloch type for higher energy- 
bands have been carried out. The 
heavy sine-wave lines have the 
wave length X of the Bloch waves, 
and the dashed lines show the 
plane free electron waves into 
which the Bloch wave functions 
would transform if the potential 
energy function C U were diminished 
to zero. 

* J. C. Slater, Rev. Mod. Phys. 6, 
209-280 (1934). 
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giving rise to reflected waves i?i to R A from planes 1 to 4, which have been 
arbitrarily selected for study. As is seen, each reflected wave starts 180° 
out of phase with the wave of the preceding plane. However, by the time 
it has reached the preceding plane it has lost an additional 180° and is in 
phase. Hence the waves R x to /? 4 are all in phase and do not cancel one 
another. The situation is quite different in (b), where/ has a wave length 
different from two lattice constants. Then the waves Ri to /? 4 are out of 
phase and cancel out on the average. As a consequence of this, it is pos- 
sible for the wave in (b) to traverse the crystal with only a small fraction of 
reflected wave present. However, the wave in (a) will build up a reflected 
intensity just equal to the incident intensity; the reflected waves will then 
be converted to the incident wave by additional reflections just as fast as 
the incident is converted to reflected. However, this situation of waves 
of equal intensity moving in both directions is mathematically equivalent 
to a standing wave. Hence it is impossible to transmit energy or electrons 
through periodic structures such as crystals, or coupled mechanical or 
electrical oscillators, when the wave length is twice the period of the 
structure. 

It is evident that, if we start with modulation waves of very long wave 
length and make them shorter and shorter, a natural limit is reached 
when X = 2a. In fact, this range of wave lengths just covers the states 
in the energy band. However, in order to discuss the extension of the 
ideas presented here to three dimensions, and to lay the foundation for 
discussing velocity and momentum, we must introduce a suitable means of 
describing the modulation waves in three dimensions. The quantum 
number required for this purpose plays the role of momentum for electrons 
in crystals and is introduced in the next section. 

Before discussing this next topic, the very close analogy between the 
results discussed here and certain electrical and acoustical problems should 
be mentioned. In electrical filter systems made of identical networks con- 
nected in tandem, there are frequency bands of transmission and bands of 
attenuation. The frequency at which the transition between these occurs 
corresponds to the condition X = 2a, the currents in adjacent networks 
being 180° out of phase. Between the transmitting bands the electrical 
signal decays exponentially from network to network through the filter. 
The electron waves corresponding to energies lying outside the energy 
bands are similarly attenuated and cannot be made to fit the periodic 
boundary conditions we have considered. They are of interest in connec- 
tion with the surfaces of semiconductors, however, but these are matters 
not covered in this text. The be*havior is entirely similar for acoustical 
lines having periodic structures. 5 

5 For a general treatment of such problems see L. Brillouin, Wave Propagation in Periodic 
Structures, McGraw-Hill Book Co., New York, 1946. 
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S.5 BRILLOUIN ZONES AND THE CRYSTAL MOMENTUM P 
AS A QUANTUM NUMBER 

In this section we shall introduce the crystal momentum in place of 
the wave length as a means of characterizing the Bloch wave functions. 
The crystal momentum, which we shall shortly define, has many con- 
venient properties: Used as a quantum number, its allowed values are 
described by a very simple scheme, and the critical condition X = la is 
simply represented in terms of the crystal momentum. In Chapters 6 
and 7 we shall use it in studying the velocity and acceleration of electrons. 

We shall first deal with a one-dimensional case. For this we define the 
crystal momentum P x along the #-axis as follows: 

P x = ±A/X (!) 

(+ for waves running to the right, - for waves running to the left). 
Since h has the dimensions of action, P has the dimensions of momentum 
(mass X length/time). 1 Furthermore, if the periodic potential were 
imagined to diminish so that the Bloch functions became simply plane 
electron waves in free space (which quantum mechanics interprets as 
electrons moving freely with constant momentum), then equation (1) 
would give the correct relationship between momentum and wave length. 
In the crystal, however, the electron is acted upon by forces so that it 
cannot be considered to have a definite momentum and for this reason we 
have used the phrase "crystal momentum" to indicate that P x is not the 
momentum in the ordinary sense. (We shall use capital P for crystal 
momentum, small p for ordinary momentum.) However, since P x has the 
attribute, discussed later, of changing at a rate equal to the applied force, 
it may be used as a momentum for many purposes. 

In terms of P x , the Bloch wave function takes the form 

v|/ Px (x, t) = u Px (x) exp [(2iri/h)(P x x - S P j)]. (2) 

We must next ask what values of P x are allowed. The answer to this 
question depends on the size of the crystal and the boundary conditions. 
We shall consider a crystal consisting of N x unit cells of length a and thus 
being A x = N x a long, and shall assume periodic or cyclic boundary condi- 
tions equivalent to supposing that the crystal is bent around until the first 
and last atoms become near neighbors. As a consequence of this boundary 
condition, the wave function at positions x and x + A x must be identical 
because these two values of x are really the same place. In other words, 
ty must be periodic with period A x . Since u Px {x + a) = u Px (x), we also 

1 This result may be recalled from the Heisenberg uncertainty relation, which specifies 
that the uncertainties in momentum and position satisfy the inequality ApAx > A/4ir where p 
represents ordinary momentum. 
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have VL Px (x + A x ) = u Px {x). Consequently, for \\/ Px to be periodic with 
period A x , we must have 

exp [(2«/A)P«(* + A x )] = exp [(2 v i/h)P x x], or exp [{2 v i/h)P x A x ] = 1, 
or (2Tri/h)P x A x = 2*in x (3) 

where n x is an integer and is, in fact, simply the number of wave lengths of 
v(f around the cyclic crystal (as shown in Figure 5.6). This leads to the 
condition 

P x = n x hjA x or 8P X = h/A x . (4) 

Thus the allowed values of P x are spaced h/A x apart. This situation is 
represented in Figure 5.9(a). The P^-axis extends to ±h/2a corresponding 
to the limiting condition X = 2a previously discussed, and, as explained, 
the wave functions at these two limiting points are identical. As we shall 
show later, values of P x lying outside the range —h/2a to +h/2a are 
equivalent to P^-values lying in this range. This region of P x space, which 
includes a complete set of non-equivalent points and is centered at P x = 0, is 
defined as the one-dimensional Brillouin zone. Its chief importance is that 
we need not, for most purposes, consider any P x -values lying outside of the 
Brillouin zone since all the eigenfunctions for the one-dimensional crystal 
are contained in the zone. In fact, because of the reciprocal relationships: 

(Size of Brillouin zone) « (Spacing in crystal) -1 

(Size of crystal) <* (Spacing in Brillouin zone) -1 

there are just as many allowed P^-values in the Brillouin zone as there are 
atoms in the crystal, this being the Brillouin-zone statement of the theorem 
of conservation of quantum states. This theorem is proved by noting that 
the number of atoms in the crystal is A x /a = N x and the number of allowed 
Ps-values is (h/a)/(h/A x ) = A x /a = N x also. Taking into account the 
two values of spin leads to 2N X quantum states in the Brillouin zone. 

The dependence of energy on P x has also been indicated for two energy 
bands in Figure 5.9(a). From this it is possible to make an energy level 
diagram for the allowed P^-values as is indicated on the right edge of (a). 
These same energy levels are plotted in (b), which is drawn to a reduced 
scale to show how the energy bands arise in the Brillouin-zone picture. 

The actual computation of energy as a function of P x can be carried out 
for idealized one-dimensional crystals, and treatments will be found in any 
modern text on the electron theory of solids and in some on quantum 
theory. 2 For electrons in actual crystals, however, laborious numerical 
calculations are, in general, required and only a limited number of cases 
have been dealt with. 



2 A simple case is treated in Chapter 14. 
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(b) DIAGRAM OF ENERGY THROUGH 
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BANDS ARISE FROM ALLOWED 
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(C) CYCLIC OR PERIODIC 
ARRANGEMENT OF 
X SCALE 



Fig. 5-9 — Structure of Brillouin Zone and Energy Bands for a One-dimensional 
Cyclic Crystal. (The Allowed Values of P x are Given by the Formula: P x = hn x \A X .) 

We must next extend the Brillouin-zone concept to three dimensions. 
For this case P x is replaced by a vector denoted byP, having components 
P x , P y , P z . Again we shall find that there is a limited volume in P-space 
to be considered which is bounded by planes corresponding to X = la. 
This limited volume of P-space, which may be defined as the most compact 
volume about P = containing all the non-equivalent values of P, is the 
three-dimensional Brillouin zone. Only values of P within this zone need 
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be considered and of these only a certain allowed set gives eigenfunctions 
satisfying the boundary conditions. We shall show that, for this allowed 
set, the theorem of conservation of quantum states once more applies. 
For three dimensions the Bloch function can be written as 

^p x p y Pz(x,y,z,t) = u Px , Py ,P,(x,y,z) X 

exp [(2wi/h)(P x x + P y y + P z z - & Px p v pj)] 

= ty p ( r , t) = u P (r) exp [(2ri/h)(P ■ r - Sjrf)] (5) 

where the second form makes use of the vector notation 

P = iP x+ jP y + kP z (6) 

r = ix +jy + kz (7) 

P-r = P x x + P y y + P z z (8) 

where i,j, and k are the customary unit vectors along x,y, and z axes (this 

use of i should not be confused with i = V~-i). We shall suppose that 

we are dealing with a simple cubic crystal with lattice constant a. (Later 

we shall comment on the modifications required for the treatment of a more 

complex crystal, such as the diamond structure.) By arguments similar 

to those used in the one-dimensional case, 3 it can be concluded that wave 

functions with P x = h/2a correspond to standing waves in the ^-direction 

and that, for any arbitrary fixed values of P y and P z , P x = h/2a and 

p x = -h/2a lead to the same wave function. Thus the Brillouin zone 

in" (P x , P y , Pz) space or P-space is a cube whose bounding planes are 

P x = ±h/2a, P y = ±h/2a, P z = ±h/2a. For more complex lattices, a 

more general method of attack must be used to find the Brillouin zone. 

However, in all cases the boundaries of the zone are determined by standing 

wave conditions like X = 2a. The planes which do the reflecting, however, 

will not necessarily be perpendicular to the x, y, and z axes and one must 

consider other possible planes. A procedure for doing this has long been 

used in studies of X-ray diffraction in crystals. According to this theory, 

the X rays will produce Bragg reflections from certain planes in the crystal 

when the wave vector (P/h) satisfies conditions equivalent to X = 2a where 

a is the spacing between the planes in question. In this way certain 

boundaries in P-space are found which enclose the Brillouin zone for more 

complex lattices. We shall not consider these more complicated cases, 

however, since it suffices for the present purposes to deal only with results 

which are generally valid for all types of lattices, and these can be illustrated 

adequately with the simple cubic model. 

3 For details of these arguments, which involve new mathematical complexities rather 
than new physical ideas, the reader is referred to F. Seitz, Modern Thoery of Solids, McGraw- 
Hill Book Co., New York, 1940; A. H. Wilson, The Theory of Metals, Cambridge at the Uni- 
versity Press, 1936; and Section 14.8. 
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(b) DIAGRAM OF CRYSTAL WITH 
CUBIC LATTICE 
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Fig. 5 -10— Three-dimensional Coordinates of the Brillouin Zone for a Crystal with 

Simple Cubic Structure. 
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If the simple cubic crystal consists of a rectangular parallelepiped with 
edges of length A x , A y , and A z parallel to the x, y, and z axes, we may find 
the allowed values forP by using periodic boundary conditions as before. 
In this case it is hard to imagine bending the crystal around in such a way 
that the faces on opposite sides are brought together as shown in Figure 
5.9(c). However, mathematically, the requirement that the wave func- 
tion be periodic with periods A x , A y , and A z in the three dimensions can 
easily be formulated. Experience in problems of this sort in theoretical 
physics has shown that the results are, in general, independent of the 
exact boundary conditions used at the surface of the sample. 4 For this 
reason, we shall not try to justify the periodic boundary condition on 
physical grounds but will indicate that substantially equivalent results 
would be obtained if we made the wave functions vanish on the boundary, 
corresponding to a potential field which holds the electrons inside the crys- 
tal. If we require that the wave function take on the same value at x and 
x + A x , we conclude by the same reasoning as before that 

P x = n x h/A x or 8P X = h/A x . (9) 

Since the values of y and z are left unaltered by this increase in x, they do 
not affect the reasoning. Similarly, by dealing with y and z individually, 
we conclude that 

Py = tlyhJAy OY SPy = h/Ay (10) 



and 



P z = n z h/A z or hP z = hj A,. (11) 



Thus the allowed points in P-space fall on the corners of a rectangular or 
orthorhombic lattice for which the unit cell has edges hj A x , h/A y , h/A z so 
that there is a volume of h 3 /A x A y A z or A 3 /FofP-space for each unit cell or 
allowed wave function. The reciprocal relationship between crystal and 
Brillouin zone in three dimensions thus is: 

(Size of Brillouin zone) <* (Unit Cell of Crystal) -1 

(Dimensions of Crystal) « (Dimensions in Array of allowed 

points in Brillouin zone) -1 . 

We shall have occasion to deal with the lattice of points in P-space 
defined by equations (9) to (11) in other connections as well and shall use 
the phrase basic P-lattice of the Brillouin zone to describe the array distinct 
points in the Brillouin zone specified by these equations. 

This lattice and the reciprocal relationships are represented in Figure 
5.10. The allowed points in the P z = plane are shown in greater detail 
in Figure 5.11(a). Here each distinct allowed point is shown by a solid 

4 We shall have further occasion in Chapter 7 to discuss the unimportance of the exact 
boundary conditions employed. 
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dot ; the open circles like P 2 on the edges of the Brillouin zone give wave 
functions identical with those of other points like P u Thus it is seen that 
there are just as many allowed points as there are rectangles in the plane 
(that is, there is one rectangle to the upper right of each allowed point). 
For three dimensions there are also just as many allowed points as there 
are elementary volumes h 3 /V. This is a general result, as is shown in 
Section 14.8, and may be expressed in the form 

density of allowed points in P-space = V/h 3 . (12) 

Since the volume of the Brillouin zone is (h/a) 3 , the number of allowed 
points is (h/a) 3 X (V/h 3 ) = V/a 3 . Thus we have the Brillouin-zone for- 
mulation of the Theorem of Conservation of Quantum States: 

The number of allowed points in the Brillouin zone is equal to the number 
of unit cells in the crystal. 

It is thus easy to see one relationship between the Brillouin zone and the 
energy band. If the crystal is assembled by varying the lattice constant 
as discussed in Section 5.3, then each atomic energy level spreads into a 
band of energy levels; the number of quantum states, usually one per 
atom, for each spin, then give rise to IN, quantum states in the band, 
where N, is the total number of atoms. These IN, quantum states corre- 
spond to the N, allowed points in the Brillouin zone combined with the 
two possibilities for the spin. If there is more than one atom per unit cell, 
the situation is somewhat more complicated. In this case the energy level 
will probably split into several overlapping bands each requiring a Brillouin 
zone to represent it. We have discussed the derivation of the theorem for 
the case of a simple cubic crystal. The theorem as stated is, however, 
actually applicable to any crystal lattice, provided that "unit cell" is 
properly interpreted. The unit cell is the smallest parallelepiped which is 
typical of the crystal and, if repeated over and over parallel to itself, would 
generate the entire crystal. For the diamond structure of Figure 1.3, the 
unit cell is a slanting parallelepiped containing two atoms and having a 
volume of one fourth the cube shown. The four electrons per atom in the 
valence-bond band of Figure 5.5 thus correspond to eight electrons per 
unit cell or enough to fill four Brillouin zones completely, using both spins. 
Thus the energy bands of interest in diamond-type crystals will correspond 
to Brillouin zones whose energies overlap in complicated ways. We shall 
indicate in Section 7.5 that certain approximations are made in treating 
this case. Some other features relating to overlapping energy bands are 
discussed in Chapter 12. 

The energy of the wave function depends, of course, on the value of P. 
In Figure 5.11(b) the dependence of energy upon P is illustrated for a 
hypothetical case, lines of equal energy being shown for points on the 
P z = plane exhibited in Figure 5.10. As is seen, the figure has a high 
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degree of symmetry, consistent with the symmetry of the simple cubic 
model being considered. The minimum energy lies at the center of the 
figure and the maximum value at the corners. The contours in Figure 
5.11(b) are spaced at approximately equal energy differences. In the 
three-dimensional diagrams, the constant energy lines are surfaces, nearly 
spherical ones near the center with more complicated shapes for larger 
energies. Finally, near the corners of the cube, the surfaces once more 
become nearly spherical. It does not always happen that the lowest 
energy occurs at the center. As we shall see later, so far as the behavior 
of the electrons as current carriers is concerned, the minimum may equally 
well be at the corners and the maximum at the center. 

It has been stated that points on opposite faces of the Brillouin zone, for 
example Pi and P2 in Figure 5.11, for which the crystal momenta differ by 
AP X = ±h/a, AP V = AP Z = 0, have the same wave function. This means 
that increasing P x beyond h/2a is equivalent to re-entering the zone on the 
opposite face and proceeding inwards. This result can be shown to be 
correct by dealing mathematically with the Bloch wave functions. It can 
also be illustrated for the analogous case of the coupled oscillators of 
Figures 5.3 and 5.6. In Figure 5.12, three modulating waves are shown 
moving past a group of harmonic oscillators according to a scheme of 
presentation similar to Figures 5.3 and 5.6. However, in this case only 
the bobs of the oscillators are shown. Part (a) shows the conditions 
P x — ±h/2a or 1/X = l/2a. The two waves are drawn as though they 
progressed to right and left in successive diagrams. In part (b) we show 
a wave with P x = {h/la) + jr(h/2a), that is, a momentum lying beyond 
the edge of the zone by 16§ per cent. Its wave pattern is also shown 
as though it advanced to the right reading down the page. In addition, 
on part (b) we show P x = — {h/2a) + \{h/2a); that is, a wave moved in 
from the left edge of the zone by the same amount that the other lies 
outside to the right. Since it is characterized by a negative value of P x , 
this wave is to be considered as moving to the left. As is seen, the motion 
of the oscillator bobs is the same for the two waves, showing that proceeding 
beyond the zone boundary is equivalent to entering it on the opposite side. 
We could thus equally well use for the zone the boundaries P x = 0, P x = h/a, 
P y = 0, P y = h/a since the points in this area would give the same set of 
modulation waves as Figure 5.11(a). Such a rearrangement has advan- 
tages for some purposes and is employed in Section 7.5. 

P-space is periodic in the sense that adding ±h/a to any component of P 
gives rise to the same wave function. For this reason points outside of the 
Brillouin zone are always equivalent to points inside the zone and it is 
never really necessary to consider P values lying outside the Brillouin zone. 
One Brillouin zone, however, gives only enough quantum states to accom- 
modate two electrons per unit cell; whereas most crystals contain many 
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more than two electrons per unit cell. This difficulty may be solved by 
using the same points in the Brillouin zone many times over, simply 
specifying in each case which energy band is concerned. Another pro- 
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(b) WAVES CORRESPONDING TO P = (lij + id?) AND 
p= -(25) + e(2a) PRODUCE THE SAME MOTIONS OF 
THE BOBS AND, IN THE CRYSTAL, THE SAME 
WAVE FUNCTIONS FOR ELECTRONS 

Fig. 5-12 — Values of Momentum Lying Outside the Brillouin Zone are Equivalent 
to Values Lying Inside Obtained by Subtracting h/a from P. 

cedure, which is more complicated, is to use the remaining portions of 
f-space outside of the Brillouin zone for other bands. There is, in fact, a 
definite procedure for doing this and the more complicated volumes 
involved in P-space are referred to as the second, third, etc., Brillouin 
zones. (The dashed lines of Figure 5.7(d) are associated with this pro- 
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cedure.) For our purposes, however, it is simplest to use only the first 
Brillouin zone, which is the one already described. For a given point in 
this zone there is a definite modulation wave, exp (2iriP ■ r/h) superimposed 
on the atomic wave function. By using the same modulation wave but 
different atomic functions, wave functions in the different energy bands are 
produced. Some examples of these have been shown in Figure 5.7(d). 
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Fig. 5—13 — Energy Band Diagrams for Several Crystal Types. 



5.6 ENERGY BANDS FOR METALS, INSULATORS, AND 
SEMICONDUCTORS 

There are characteristic differences between the energy bands of metals, 
insulators, and intrinsic semiconductors in terms of which their electrical 
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conductivity can be understood. As we shall see later, when all the 
quantum states of a Brillouin zone are occupied, the zone cannot participate 
in conduction. In the case of a metal, the energy bands overlap in such a 
way that, as occupied by the available electrons, some Brillouin zones are 
left partially filled. On the other hand for pure diamond, one set of 
Brillouin zones is occupied by valence electrons, and the next higher set 
(the conduction band) is left entirely empty. The energy gap between 
them is so great that electrons are not thermally excited, and diamond, 
therefore, has either completely filled or completely empty Brillouin zones 
and is an insulator when in thermal equilibrium at room temperature. On 
the other hand, germanium has its valence-bond bands and conduction 
bands so close together that it would exhibit appreciable conductivity at 
room temperature even if pure. The diamond form of tin, called gray tin, 
which is stable below room temperature, is probably either a metallic 
conductor with overlapping energy bands or at least an intrinsic semi- 
conductor of very high conductivity. The energy gaps for tetravalent 
elements are thus thought to be: 

Carbon Tin 1 Tin 

(Diamond) Silicon Germanium Gray Metallic Lead 

6 to 7 ev 1.11 ev 0.72 ev 0.1 ev (overlapping bands) 

The donors and acceptors, which can bind holes and electrons as described 
in Chapter 1, give rise to energy levels which lie in the energy gap. A 
quantum-mechanical treatment of these energy levels and their relationship 
to energy bands is given in Chapter 9. 

The energy level schemes for the various cases just discussed are shown 
in a conventional form in Figure 5.13. We shall return to a discussion of 
the statistical distribution, under thermal equilibrium conditions, of 
electrons among such energy levels at the close of Chapter 10. 

Problems 

1. Carry out an analysis, similar to that for Figure 5.6, for four oscillators 
and for eight oscillators. Discuss the differences that would be introduced 
if the number of oscillators were changed to an odd number. 

2. The patterns shown in Figure 5.6 resemble traces seen on oscillo- 
scopes. Show that, if the horizontal plates are driven at frequency/ and 
the vertical plates at nf, patterns like the dashed lines of Figure 5.6 will be 
obtained. If the y plates have also a small signal at/ and 90° out of phase 
with the horizontal plates, even the perspective will be simulated. What 
will happen if the y frequency is slightly different from «/? 

3. Assume a force constant for the vertical spring and an initial tension 

1 G. Busch, J. Wieland and H. Zoller, Conference on the Properties of Semiconducting 
Materials, University of Reading, July 10-15, 1950. 
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in the coupling springs; assume also that the masses of Figure S.6 slide on 
frictionless vertical rods. Work out the frequency of the waves for small 
amplitude disturbances. 

4. Compare qualitatively the distribution of allowed frequencies for 
"clamped end" boundary conditions and periodic boundary conditions, 
for an odd number of coupled oscillators. 



CHAPTER 6 

VELOCITIES AND CURRENTS FOR ELECTRONS 

IN CRYSTALS 

This chapter serves to complete the Brillouin zone description of the 
properties of the Bloch wave functions; with it we conclude the general 
description of the behavior of electrons moving in the periodic field of a 
perfect crystal in the absence of external fields. The new aspect added 
here is the velocity of motion associated with each quantum state in the 
Brillouin zone. In a text designed exclusively for physicists, this topic 
would be treated principally on the basis of the postulates and operations 
of quantum mechanics. While in our treatment we refer to these quantum- 
mechanical methods, we emphasize primarily the analogy with other 
problems involving wave motion. At the end of this chapter we are 
within two steps of the elementary treatment of conductivity: In Chapter 7 
we consider the effect of electric and magnetic fields on altering the behavior 
of electrons, and in Chapter 8 we introduce the idea of random processes 
which produce resistance. 

6.1 THE VELOCITY AND CURRENT CONCEPTS 

When an electron is moving in the crystal in the manner prescribed by 
one of the Bloch wave functions, it has an "average" velocity of motion 
which gives rise to a current. In fact, for each quantum state m the Bril- 
louin zone there is a certain average velocity, the one which an electron 
would have if it occupied that state. It is the aim of this chapter to discuss 
the relationship of this average velocity, which will be denoted by the 
vector symbol v, to other properties of the Bloch wave functions. 

As described in connection with Figure 5.7, the complex Bloch waves 
flow through the crystal. However, the electron probability density is 
the same in every unit cell so that no apparent flow can be detected by 
examining the distribution of the probability density for the electron in 
space. This brings us to the important matter of interpreting the meaning 
of the Bloch wave function in terms of more familiar descriptions of the 
electron's behavior. We shall find that the theory indicates that the 
electron is actually moving through the crystal with a definite average 
velocity. It should be pointed out at once that this involves no contradic- 
tion with the uniform probability distribution. In a number of other 
cases, the entity which flows may show no shift in its density distribution. 
For example, a rotating flywheel may look perfectly stationary, yet the 
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iron which is uniformly distributed around the rim is actually moving. 
The energy density along a power line may be uniform and unchanging, 
yet power flows. (An example of this sort is presented in Section 15.3.) 

The Bloch function represents a state of motion for the electron in which 
it is equally likely to be in any unit cell of the crystal while at the same 
time it has an average velocity of motion through the crystal. The reason 
that this motion does not carry it out of the crystal after a time is a mathe- 
matical one associated with the periodic boundary conditions. These cor- 
respond (as shown in Figure 5.9(c) for the one-dimensional case) to a 
cyclic crystal in which flow out of one end is equivalent to flow in at the 
other. Thus the electron flows indefinitely through the crystal. 

This flow can be expressed as a current density in a form useful for later 
conductivity considerations. Thus, in the one-dimensional case, if the 
average velocity of motion is v x , then the electron traverses one cycle about 
the loop in a time A/v x . This means that the. charge — e of the electron 
flows around the loop v x /A times per second and hence that the current is 
/ = —ev x /A = ( — e/A)v x , the second form showing that the current can 
be expressed as charge per unit length, ( — e/A), times average velocity. 
In three dimensions, the current density vector or charge per unit area per 
unit time can be expressed similarly as 

/ = (-e/F)v, (1) 

that is, as the charge per unit volume times the vector velocity. 1 The 
derivation is as follows: If the electron has an average vector velocity v 
with components v x , v y , v z , then it makes v x /A x traversals in the ^-direction 
per unit time; on each traversal it carries — e across the end face of the 
crystal giving rise to a current of —ev x /A x . Since the end face has an area 
A y A z , the current density in the ^-direction is given by the formula 

I x = (-ev x /A x )/A y A z = {-e/V)v x , (2) 

where V = A x A y A z is the volume of the crystal. Similar calculations for 
the y and z axes then establish the general formula. Actually, the argu- 
ment given here, which serves to introduce the formula for current density 
and to illustrate the idea of the average velocity, is in a sense the reverse of 
the one logically employed in the theory to find the average velocity. This 
will become apparent from the method of attack used to evaluate the 
average velocity. 

6.2 THE POYNTING VECTOR OR DENSITY-FLOW METHOD 

Since no motion can be detected by computing the change in the prob- 
ability density, other methods of determining the velocity must be em- 

1 Even before the electron crosses the surfaces, it produces a displacement current given by- 
Equation (1 ) . See, for example, W. Shockley, /. App. Phys., 9, 635-636 (1938). 
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ployed. These are directly available in the theory of quantum mechanics, 
and a definite procedure is given for calculating the average current flow 
from a wave function. We shall not present this method here 1 " but shall 
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Fig. 6-1 — Comparison of the Calculation of Average Speed for an Electron with the 
Calculation of Flow of Electromagnetic Energy. 

describe instead its analogue in terms of power flow in electromagnetic 
theory. The relationship is shown in Figure 6.1. In Section 15.3 the 
procedure is actually carried out in detail for an artificial lumped-constant 
transmission line having no loss. For this case, the energy per unit length 
is computed for a line with a particular running wave, the running wave 
being found by the procedure leading to "waves in periodic structure" 
u Derivations are given in Sections 14.3 and 15.4. 
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shown in the figure. The equation for power flow is then simply obtained 
by evaluating the current flowing from left to right at some point on the 
line and multiplying this by the voltage and proper phase factor. This 
familiar expression for power flow is analogous to the Poynting vector for 
electromagnetic waves or to the quantum mechanical expression for 
average current. We next argue that the power flow along the line is due 
to the flow of its energy density. Consequently, if its velocity of flow is v, 
all the energy in a length v of line flows past a point in unit time, and, 
therefore, power is energy density times speed of flow as shown in Fig- 
ure 6.1. If the line has no phase distortion, that is, all frequencies travel 
at the same speed, or, in other words, there is no dispersion, then this rate 
of flow always turns out to be the velocity of the waves. However, if the 
speed of the waves depends on the wave length, then the power does not 
flow at the same speed as the waves but instead at a speed given by the 
formula 



dv 



V - 



d{\/\) 



(1) 



This expression equals the speed of the waves only for the case of constant 
wave speed for which 

dv dc{\/\) 



v = c/\ so that 



d{l/\) d{\/\) 



(2) 



where c is the speed of the waves. 

When the analogous procedure using the expressions for electron current 
density are employed as indicated in Figure 6.1, precisely the same formula 
is obtained for electron probability density- as for electrical energy for the 
one-dimensional case. However, since the relationships 



hv = S and P x = h/\ 
apply, we can rewrite the formula in the form 

dv d& 

</(l/X) = 1F X ' 



v = 



(3) 



(4) 



This formula is identical in form with that obtained classically, for which 
p = mv and & = mif/l = p 2 /2m so that dS/dp = p/m = v. However, 
the physical meaning is different, since, for electrons in a periodic potential, 
6 does not equal p 2 /2m. 

There is a fundamental explanation for the fact that the same formula 
v = dv/d(l/\) is obtained for both electromagnetic waves and electron 
This explanation is based on the use of wave-packets and group 



waves. 



velocity, topics which are discussed in the next section. 
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Fig. 6-2 — Motion of a Minimum-uncertainty Wave-Packet in Free Space. 
The product of uncertainty in position and momentum at / = for this packet has 
the minimum value Ap Ax = /i/4tt permitted by the Heisenberg uncertainty relation. 

(a) Plots of the real part of the wave function ^{x,t) for several instants of tim;. 
Since this is a one-dimensional wave-packet, ^ has the dimension of cm -1/5 The imagi- 
nary part of the wave function is similar in shape and i(x,t) is shown by the envelope 
lines. The velocity of the group is twice as great as the phase velocity of the waves 
which compose it: while the crest of the waves advance one wave length between t 
and 4, the group advances two wave lengths. The packet is composed of momenta 
varying by about ±8% from the mean value. This spread in component velocities 
causes the packet to spread appreciably in length during the time from k to /„ the 
higher momentum components gathering near the front of the packet and giving the 
wave function a shorter wave length while the lower components similarly gather at 
the rear. 

(b) The continuous progression of the lines of constant complex phase angle is 



6.2] 



WAVE-PACKETS AND GROUP VELOCITY 



161 



W ,3 X40 



I I I 1 | I I I I | I I I I | I 




40XI0~ ,& 



-5XI0" 7 



5 10 15 20 

DISTANCE, X, IN CENTIMETERS 



25 ICT 7 X30 



Fig. 6-2 — Continued. 
shown by the dashed lines. When the angle is a multiple of 2x, the wave function is 
real and positive, corresponding to the crests in (a). The line for maximum probabil- 
ity amplitude advances with twice the phase velocity and the lines at which the 
probability amplitude falls to l/<? 2 of the maximum are seen to spread out. The bend- 
ing back of the lines of constant phase near x = for the larger times corresponds to 
motion to the left, which results from the fact that there is some probability that the 
electron will have a negative momentum. 

The average velocity corresponds to 1 electron volt of energy with a velocity 5.92 X 
10 7 cm/sec and a wave length of 12.2 X 10~ 8 cm. The value of Ax at / = was 
arbitrarily chosen as 12.2 A in computing the packet. This gives Ap = A/4xA* = 
pi/4w where pi corresponds to 1 ev. The spread in momentum increases the energy 
6cm p\/2m to (pi + Ap 2 )/2>» = p\(\ + (1/16tt 2 ))/2« = 1.0063 ev. 

For earlier times than / = 0, the packet also spreads out in a way symmetrical to 
that for later times. The theory of this wave-packet is discussed in Section 15.5. 
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6.3 WAVE-PACKETS AND GROUP VELOCITY 

We shall next interpret the velocity formula in terms of wave-packets 
and group velocities, both of which are useful concepts in wave mechanics. 
Accordingly, we suppose that instead of sending a single frequency down 
the transmission line we send a pulse signal at a certain carrier frequency. 
This pulse may be regarded as the result of superimposing a number of 
single-frequency waves. The pulse localization results from the construc- 
tive interference of these waves at the location of the pulse and the destruc- 
tive interference elsewhere. In general, these waves travel at varying 
speeds, and thus the interference pattern changes. As a result of the 
shifting of the interference pattern of the group of single-frequency waves, 
the pulse travels at a speed different from that of the individual waves. 
The terminology used to describe this situation is as follows: The pulse is 
said to be a wave-packet, which is made up by the constructive interference 
of a group of waves. Each of these individual 
waves moves with its own velocity called the 
phase velocity, whereas the packet moves with 
the group velocity. If all the waves of the group 
travel with the same phase velocity, then the 
wave-packet moves with them with unaltered 
shape. However, if the phase velocity varies with 
the wave length, then the interference pattern 
shifts and the wave-packet may go either faster or 
slower than the waves which build it up. When 
the mathematics of this interference problem is 
analyzed (see Chapter 15), it is found that the 
group velocity is given by the formula 

o = • (1) 
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d6 
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V = 7 P 6 (P) 



The reason that the same formula results from both 
the power flow calculation and the group velocity 
calculation may be understood from the follow- 
ing argument: In the case of the wave-packet the 
energy is localized in a certain region of the trans- 
mission line, hence it must flow at the same speed 
as the wave-packet. On the other hand, when the 
wave-packet is passing a certain point on the line, the situation locally is 
much the same as for the state of steady power flow, and energy will be 
flowing past at the speed determined from the steady flow calculation. 
Hence the speed of flow of the steady-state power must be given by the 
group-velocity formula. (The equality for the artificial line is given in 
detail in Section 15.3.) 



Fig. 6-3— Outline of 
Argument Used to De- 
rive Velocity of Elec- 
tron from Group-veloc- 
ity Concept. 
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This same procedure may be used to analyze the behavior of the Bloch 
waves. The argument is outlined in Figure 6.3. In order to determine the 
velocity in one Bloch function, we combine it with others of approximately 
the same value of P x . In this way we get a localized wave-packet. (Such 
a wave-packet for an electron is shown in Figure 6.2, and its properties are 
discussed in the associated text.) We then argue that since all the wave 
functions are very similar (that is, only a small difference in their P x 
values), they correspond to about the same average velocity for the elec- 
tron, and thus the packet which represents the average of their effects must 
also have the same velocity. It is then a straightforward mathematical 
problem to evaluate the group velocity from a study of the interference 
patterns. The rasult is quite general, and, as derived in Section 15.2, it 
applies to any set of interfering waves. It agrees, of course, with that 
obtained by the density-flow method. We shall next discuss the conse- 
quences of the formula for a one-dimensional case. 



6.4 VELOCITIES FOR QUANTUM STATES IN 
BRILLOUIN ZONES 

As pointed out previously, the relationships & = hv and P x = k/\ 
enable us to transform the group velocity equation to 

v = dS/dPx (!) 

for the one-dimensional case. This result is in agreement with the classical 
Hamiltonian expression. In the classical theory of analytical dynamics 
the energy is written in the form "3C(pi,qi) where the quantities j» are 
coordinates and the quantities pi are momenta. For a single particle the 
coordinates and momenta are conveniently chosen as 



q x = x, 



<li = y> 



q 3 =• Z 



dx 



dy 



dz 



Other choices can be made for special problems, such as the problem of 
planetary motion. The relationship between classical and quantum 
mechanical Hamiltonian expression is discussed in Section 14.3. 
ton's equations of motion are 



Hamil- 



li = 



d'X , . atJC 

and Pi = — - — 

dpi F dq t 



For the case of an electron moving in one dimension in a constant potential 
energy °0 , both quantum and classical mechanics give kinetic energy = 
p 2 /2m, so that the total energy is 6 = °0 + p 2 /2m, and the group velocity 
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\s v = p/m or p = mv, the classical result. 1 The value °D 05 although it 
contributes to the frequency of the Bloch wave, through v = 5/ K does not 
affect the group velocity. In other words, the group velocity is independent 
of the zero selected for the energy scale whereas the phase velocity is not. 
Since only the group velocity contributes to the current, this variability 
of the phase velocity is, however, not important. 
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Fig. 6-4 — Energy and Velocity in Brillouin Zones. 



For definiteness, some of these results are shown graphically in Figure 6.4. 
Here energy and group velocity are plotted for points along the Praxis. 
The upper diagram shows the energy for two energy bands, and the lower 
diagram shows the velocity for the upper energy band. In both cases 
dotted lines show the periodic dependence of energy and velocity on P x , 
points outside the zone being equivalent to points inside obtained by adding 
or subtracting h/a from P x . 

1 We use p for momentum in the ordinary classical or quantum-mechanical sense and P for 
the crystal momentum. 
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In three dimensions, it is necessary to find a vector velocity with com- 
ponents similar to v = d&/dP x for one dimension. These are shown in 



Chapter 15 to be 



or, in vector notation 



v x = d&(P x ,P y ,P z )/dP x 
v y = d&(P x ,P yy P,)/dP y 

V t = dS(P* t Py>P,)/dP, 



v = v>§(f). 



(2a) 
(2b) 
(2c) 

(3) 



-CONSTANT 
/i ENERGY 
( 'CONTOURS 



Geometrically, this relates v to the energy surfaces in the Brillouin zone 
as follows: To find the velocity corresponding to P, construct in P-space a 
line perpendicular to the surface of 
constant energy passing through P. 
Proceed along this line in the direc- 
tion of increasing energy and evaluate 
d&/d\P\ along this line. The group 
velocity corresponding to P has the 
direction and magnitude so obtained, 
it being, of course, supposed that the 
P x , P y , P z axes of the Brillouin zone 
are parallel to x, y, z axes of the crys- 
tal. 2 This process will lead to a vector 
distribution of velocities in the P x P y 
plane of a Brillouin zone like that 
shown in Figure 6.5. Here the lengths 
of the arrows indicate the velocity 
corresponding to the point in P- 
space from which they originate. 
Since P and v have different dimen- 
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Fig. 6-5 — Dependence of Velocity upon 

Crystal Momentum. The vectors give 

direction and relative magnitudes of the 
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sions, only the d.rections and the ^ ^ ^ ^ ^ eRergy con _ 

relative magnitudes of the arrows are tours are spaced at equal energy inter . 

significant. vals with maximum energy at the 

The evaluation of the veloci ty com- corners.) 

pletesthedevelopment of the Brillouin 

zone description of the properties of Bloch wave functions as outlined in 
Figure 5.1. To summarize briefly the present aspect, we find that an 
electron in a Bloch function moves so that it is equally likely to be anywhere 
in the crystal. It has an average velocity of motion v which carries it 
cyclically through the crystal over and over again. In this connection, it 
should be noted that as the electron moves through the crystal in accordance 
with the velocity shown in Figure 6.5, its P value does not change. That is } 

8 We have merely defined in words the meaning of V P &, the gradient of & in P -space. 
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the vector in Figure 6.S shows a velocity in ordinary space, not in P space. 
The value of P is changed by externally applied forces or scattering proc- 
esses, which are considered later. 

We can reach a conclusion bearing on conductivity from the matters just 
discussed, which we shall present here as a prelude to the following chapters, 
where we shall again derive it as a limiting case: From the properties of the 
Brillouin zone, we conclude that a full zone can carry no current. This 
result follows simply from symmetry. For all lattices it is found that the 
energy takes on equal values for P and — P and, as a consequence of this, 
the velocities for P and — P are equal and opposite. In a full zone, since 
every allowed state is occupied, the electrons always cancel off their veloci- 
ties in pairs so that there can never be any velocity unbalance resulting in 
an average velocity for the electrons as a whole. (A further discussion of 
the additivity of the charge and current densities is given in Sections 15.7 
and 15.8.) 



CHAPTER 7 

ELECTRONS AND HOLES IN ELECTRIC AND 
MAGNETIC FIELDS 

The chief aim of this chapter is to show that an electron in an otherwise 
empty conduction band, and a hole in an otherwise full valence-bond band 
behave, respectively, much like negative and positive particles in free space, 
when acted upon by electric and magnetic fields. The treatment is 
founded on the basic law for the rate of change of crystal momentum when 
the electron is subjected to a force. This law, although in close analogy 
with Newton's second law F = ma, must be derived on the basis of quantum 
mechanical arguments such as those given in Chapter 15. In Section 7.3 
we give a derivation, which, although not rigorous, is helpful in making the 
law seem reasonable. The method of derivation followed in Chapter 15 is 
indicated in general terms in Figure 7.1, together with some consequences 
of the law which are treated in various sections of this chapter as indicated 
on the figure. 

7.1 THE EFFECT OF A FORCE P = F 

When the field in which the electron moves in the crystal is modified by 
adding, to the periodic potential **& of the crystal, terms representing applied 
electric and magnetic fields, the Bloch wave functions are no longer solu- 
tions of the time-dependent Schroedinger's equation. However, it is found 
that if the vector P in the Bloch functions is varied with time according to 
the law 



dP 

dt 



= P = F, 



(1) 



then the resulting wave function is a good solution. The expression to be 
used for F is the classical one. The electric field £ exerts a force —eE and 
the magnetic field H exerts a force at right angles to both H and the average 
velocity v of the electron. This leads to the expression for the force on the 
electron 



F = -e{E + v X H/c) 

where c is the speed of light. 1 

1 For the meaning of v X H for a particular case, see Figure 8.7. 
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The similarity between equation (1) and Newton's second law of motion 
F = ma = d(mv)/dt is the basis for calling P the crystal momentum. 

If the force is steadily maintained, the point in the Brillouin zone repre- 
senting the electron (that is, "representative point") moves steadily in the 
direction prescribed by the force. When it reaches the surface of the zone, 
it becomes equivalent to a point on the opposite side and continues into 
the zone from the equivalent point. 
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Fig. 7-1 — Outline of Derivation of P = F and the Consequences P — F. 



The result P = F implies that all points in the Brillouin zone represent 
possible quantum states instead of just the basic P-lattice of equations (9) 
to (11) of Section 5.5 which arose as a consequence of the periodic boundary 
conditions. The reason for this difference is that, in general, the applica- 
tion of electric and magnetic fields alters the periodic character of the 
boundary conditions. Thus in a straight wire carrying a current the volt- 
age at one end of the wire will not be the same as at the other, and it is 
meaningless to require periodic boundary conditions to apply between the 
two points. However, it is actually possible to bend a wire into a circle, in 
which case periodic boundary conditions should apply to the coordinate 
which runs around the circle. It is also possible to apply an electric field 
to such a wire by using it as a short-circuited secondary winding on a trans- 
former. A model of this sort is treated in Section 15.9 and it is found that, 
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although the induced electric field produces changes in the wave function, 
it does not change the number of wave lengths of the wave function around 
the circle. However, it does change the energy of the electron and the 
current carried by it in precisely the way they would change if the periodic 
requirements on the boundary condition were disregarded and the wave 
function altered according to the formula P - F. ^ 

The details of the argument involve mathematical procedures which 
cannot well be described in simpler terms. However, the result is that, for 
the periodic model, all energies and velocities vary just as if the lattice of 
points representing allowed quantum states shifted under an electric field 
in accordance with the law P = F. We shall accordingly treat the effect of 

fields in this way. 

The same result may also be obtained by considering a wave-packet 
composed of Bloch functions all having approximately the same value of P. 
When electric and magnetic fields act on the packet, it is altered so that the 
average value of P changes according to the law P = -e(E + v X H/c). 
Corresponding results are obtained if the wave-packet is made to represent 
a hole. (Sections 15.6 to 15.8.) • 

As a consequence of these facts, we are justified in taking the law P - F 
as being the proper generalization of Newton's Second Law of Motion for 
electrons in a crystal. Since the application of this law to all electrons in 
the states of a Brillouin zone produces a flow of all the occupied states in 
the zone, we may also consider it to apply to the unoccupied states as well. 
No error is introduced by this process, since, if the states are empty, we are 
not concerned with them directly and, if they are occupied, the behavior is 
correctly given. The procedure of considering that the empty states also 
vary according to the same law, however, has advantages in analyzing the 
behavior of holes and is used in Section 7.6. 

Finally it may be remarked that in the presence of a magnetic field the 
relationship between classical momentum^ and velocity v, and between the 
corresponding quantum mechanical operators, is no longer p = mv. The 
correct relationship is discussed in Section 15.6. It is shown, however, 
that the relationships 

S - S(P) and v = VpS(P) (3) 

are unaltered by a magnetic field so that equations (1), (2), and (3) con- 
stitute a complete set of equations for determining the behavior of an 
electron in applied fields, at least for linear terms in E and H. 

7.2 THE THEOREM OF THE CONSERVATION OF 
QUANTUM STATES 

The flow of points in the Brillouin zone is compatible with the theorem 
of the conservation of states for the case of electric and magnetic fields. 
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We shall treat first the electric field, returning to magnetic fields later. 
In the case of the electric field the result is obvious. Since the force is 
— eE and the same for all values of P, the lattice of allowed quantum states 
simply moves as a solid with translational motion through the Brillouin 
zone. It is evident that this leaves the total number of allowed points in 
the Brillouin zone unchanged and also has no effect on the density of their 
distribution. 1 

7.3 THE PRINCIPLE OF THE CONSERVATION OF 
ENERGY FOR AN ELECTRIC FIELD 

If a particle is accelerated by a force F, we should expect its gain in 
energy to be equal to the work done on it by the force. The power furnished 
to the particle by the force is v ■ F and the rate of change of energy is 
g = Vpg . P. According to the laws we have discussed P = F and v = 
V/>&, so that these two powers are equal, and conservation of energy applies 
to the behavior of an electron subjected to a force. 

Although energy is conserved when the electron is acted upon by a steady 
electric field, its behavior in other respects is strikingly different from that 
of a classical particle in free space. To a large degree, the particularly 
unusual features, which we shall shortly describe, are suppressed by the 
random processes of scattering which are always active in real crystals 
(except metals in the superconducting state). However, it is worth while 
to present these results because they are direct, logical consequences of the 
theory presented in Figures 6.4 and 6.5 and, in addition, they illustrate 
principles which we shall later use. 

Accordingly, we shall consider the motion of an electron in a perfect 
crystal under the influence of a uniform force F in the x-direction. If this 
electron is initially at the lowest state in the upper band at P x = in 
Figure 7.2, then its value of P x will increase toward the right at a constant 
rate P x - F x . Its velocity will initially increase to the right also, and, in 
fact, directly in proportion to the momentum so that, as long as the curve 
of part (b) may be well approximated by a straight line, the velocity 
increases uniformly in time as it would for a classical particle, such as a 
falling body. For these conditions the behavior of the electron can 
actually be treated in a classical fashion, as will be discussed later. How- 
ever, after a certain length of time the velocity will reach the maximum of 
the curve in part (b) and, thereafter, will decrease. Thus in the region 
where dv x /dP x is negative, the electron behaves as if it had a negative mass; 
that is, in this region the continued application of a force to the right 
actually produces a deceleration to the right and the particle slows down. 

1 It also leaves the wave functions orthogonal. 
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This behavior has no classical analogue in the dynamics of particles and is 
due to the wave-mechanical laws governing the electron. If the field is 
maintained long enough, P x reaches the value h/la, after which its velocity 
becomes negative as indicated by the dotted line, and the electron moves 
against the force. (This behavior is, of course, equivalent to moving P x 
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back to an interior point on the opposite side of the zone in accordance with 
the periodic property of P x .) Finally, the velocity reaches a maximum 
value in the negative direction, corresponding to the minimum of curve (b), 
and then decreases in magnitude to zero once more, after which the cycle 
repeats itself. In other words, the particle oscillates back and forth. If a 
wave-packet were made up using wave functions, all having nearly the same 
momentum, this wave-packet would oscillate back and forth in the crystal. 
In fact, the position of the wave-packet as a function of time would be a 
curve of exactly the same shape as that which gives its energy as a function 
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of momentum, a result which can be seen from the following equations: 
* =Joxdt = J(d&/dP x )dt = j \d$/dP x )dP x /F x 



&(P*)/F X + const = &{F x t)/F x + const 



(1) 



since if F x is applied at t = 0, P x = F x t and dt = dPJF x . Since the work 
done on the particle is xF x , this equation, in addition to giving the position 
of the electron as a function of time, is also an expression of the conserva- 
tion of energy. The period of the motion is h/aF x , the time required for 
P x to traverse the Brillouin zone. 

The oscillatory aspect of the electron's behavior in this case can be 
profitably considered in calculating the probability of exciting an electron 
from one band to the other under the influence of a strong applied field 1 — a 
subject not within the scope of this presentation, however. Instead, the 
discussion just given should be regarded as intended to illustrate features 
of the theory of Brillouin zones. As we shall see, collision processes intro- 
duce random transitions of the electron from one state to another in times 
less than the period of the motion so that the anomalous behaviors con- 
sidered here play no important role in ordinary conduction processes. 

7.4 THE EFFECT OF A MAGNETIC FIELD 

We shall next consider the forces exerted by a magnetic field upon a 
moving electron and the consequent changes in momentum. We shall treat 
the restricted case of a crystal momentum in the P z = plane for a 
magnetic field in the z-direction. In Figure 7.3(a) we represent the P x P y 
plane of a Brillouin zone with the energy contours and velocity shown as in 
Figure 6.5. The velocity, it will be recalled, is proportional to the gradient 
of 5 in the zone and is thus perpendicular to the energy contours; the 
direction and relative magnitude of the velocity are indicated by arrows in 
the figure. 

When a magnetic field is applied in the +z-direction, the vector force 
— ev X H/c on the electron will result in a changing momentum. Since 
the force is perpendicular to v, the momentum will change as indicated by 
the arrows in (b). This results in a flow of representative points along the 
energy contours. Thus, the electrons gain no energy from the magnetic 
field — a result consistent with the fact that the force produced by H is 
perpendicular to the motion of the electron and hence does no work on it. 
As discussed in connection with the Pauli principle, this flow should not 
change the density of the points and should thus be incompressible. This 
necessary result comes directly from the formula for the change of momen- 

1 C. Zener, Proc. Roy. Soc. 14SA, 523-529 (1934). 
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turn. Thus if we consider the lines Li and L 2 in Figure 7.3(b), we can 
show that the flow of particles across L\ is just the same as across L 2 ; for 
example, if Li is half as long as Z^, then the velocity of an electron at L\ 
will be twice that at L 2 , and consequently the force and rate of change 
of momentum will be doubled. Hence, equal volumes of momentum 
space will flow across L\ and L 2 in equal times. Accordingly, if the 
quantum states are initially uniformly distributed, there will be no tendency 
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for them to accumulate between Li and L 2 since the flow across Z-i and Z, 2 
are equal. (For three dimensions this result can be put in the form that 
divergence of P is zero, 1 so that the general conclusion is that a magnetic 
field by itself serves only to move the points about on their energy surfaces 
without producing any accumulations or affecting the properties of the 
equilibrium distribution.) 

These general results for the change in P for an electron in a Brillouin 
zone under the influence of electric and magnetic fields will next be applied 
to the behavior of a single electron in an otherwise empty conduction band 
and a single hole in an otherwise full valence-bond band — still moving in a 
perfect crystal, however, so that random processes are prevented. 

1 Vp • (VpS X H) = H(V/>X V/>£) + V f S • (Vp X ff) =0-0. 



174 EFFECTS OF ELECTRIC AND MAGNETIC FIELDS 17.5 

7.5 THE BEHAVIOR OF AN EXCESS ELECTRON: EFFECTIVE 
MASS AND REARRANGEMENT OF THE BRILLOUIN ZONE 

In order to apply the theory just discussed to problems of conductivity 
and Hall effect, it is necessary to deal with specific properties of the energy 
surfaces in the Brillouin zones. The calculation of energy surfaces has 
actually been carried out for a number of metals by using approximate 
methods. However, no detailed calculations are as yet available for semi- 
conductors. For this reason, a certain amount of guess-work is involved in 
dealing with these cases. Fortunately, however, there are general prin- 
ciples, like those involved in determining the shape and size of the Brillouin 
zone, which can be used as guides. These principles are based largely on 
symmetry conditions which the surfaces must satisfy in order to be con- 
sistent with the basic symmetry of the crystal. In addition there are some 
quantitative calculations on certain features of the energy surfaces which 
help to establish numerical magnitudes. Finally there are experimental 
data which can be used to test the reasonableness of the assumptions con- 
cerning the shape. 

On the basis of these arguments, we shall proceed by assuming that the 
diamond-structure semiconductors have approximately spherical energy 
surfaces at the highest energies of the valence-bond band and the lowest 
energy of the conduction band, these being the regions of interest for con- 
ductivity. 1 Furthermore, we are interested only in energies quite near 
these edges of the bands, the reason for this being, as discussed in more 
detail in connection with the Fermi-Dirac statistics, that thermal energies 
can excite electrons by only about 0.03 electron volt of energy, whereas the 
band may be several electron volts wide. Hence the states of interest are 
within a few per cent of the edge of the energy band. 

In order to deal with electrons and holes, we must consider both the 
bottom of the conduction band and the top of the valence-bond band. 
Furthermore, for each of these, the energy of interest may occur at either 
the center or the corner of the Brillouin zone. Fortunately, all of these 
cases may be treated on an equivalent basis, as we shall show. We shall, 
therefore, treat first the simplest case and later show the relationship of the 
.other cases to it. Accordingly, we suppose that the bottom of the conduc- 
tion band comes at an energy S at the center of the zone; this is the situa- 
tion shown in Figures 6.5 and in 5.11 if the highest energy is at the corners. 
General theoretical arguments show that the spherical surfaces must have 
.the equation 

5 = So + K{P 2 + P 2 + P 2 ) = $o + KP 2 (1) 

1 See, for example, F. Seitz, Phys. Rev. 73, 549-564 (1948). I am informed by C. Herring, 
however, that there are some strong theoretical reasons for believing that one of the bands is 
degenerate at its limiting energy with consequent complications in the shape of the energy 
surfaces, We stall return to this question in Chapter 12, 
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where P is the magnitude of P. This gives rise to a velocity 

v x - d&/dP x = 2KP X 
v y = d&/dP y = 2KP„ 
v, = aS/aP, = 2KP, 



or in vector notation, 



y = 2/CP. 
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(2a) 
(2b) 
(2c) 

(3) 



If a force F is applied to the particle, the velocity changes according to the 
law 

i, = 2KP = 2KF. (4) 



This is analogous to Newton's second law in the form 
F = m n a or v = f/»» n , 



(5) 



if we interpret 1/2K as an effective mass m n for an excess electron. In 
other words, the application of a force changes the velocity of the electron 
just as if it had a mass of 1/2K. 

The quantity 1/2K for states in the conduction band (or 1/2K for the 
valence-bond band) is of the same order of magnitude as the mass of the 
free electron, a result which we shall derive by general considerations of the 
wave functions in Section 14.7. From the experimental data on silicon 
and germanium it appears to be quite near the electron mass. For most 
of the purposes of this chapter not enough use is made of the difference 
between the effective mass and the mass of the free electron to warrant 
emphasizing the effective mass at this point in the treatment by giving it a 
special symbol. We shall, therefore, write l/2m for K and proceed 
accordingly. 2 

On the basis of this assumption, we see that the response of an electron 
at the minimum energy point of a Brillouin zone to a force is the same as 
for a free electron with 



i = F/m. 



(6) 



We shall use this relationship later in connection with conductivity. 

What happens if the lowest energy of the zone comes at the corner of the 
zone? To illustrate this it will suffice to consider a two-dimensional 
example. This is shown in Figure 7.4, where, for emphasis, the higher 
energy contours have been omitted. In (a) the effect of a force in accel- 
erating an electron from P to P 5 is indicated. Each time the momentum 
reaches a point on the zone boundary, the wave function becomes equiva- 
lent to a point on the opposite boundary, as shown. (It may be noted in 

2 A wave-packet electron made from quantum stages where the effective mass is very small 
can be accelerated more readily in a crystal than it can be in vacuum. The way in which 
interaction with the crystals helps to accelerate the electron is discussed in Section 14.7. 
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this case that when the point reaches a boundary, v is parallel to the bound- 
ary so that, as previously discussed, the wave function carries no current 
in a direction perpendicular to the boundary.) If the zone is rearranged 
for convenience as shown in part (b), the change in momentum fromP to 
P 6 becomes a continuous straight line. If we now describe the momentum 
of the electron by the vector P' which is drawn from the center of (b), we 
once more have the relationship that 

& = So + K'(P X ' 2 + P v ' 2 + P.' 2 ) = So + K'P' 2 



(7) 




(a) THE EFFECT OF A FORCE AC- 
CELERATING AN ELECTRON 
FROM P TO P 5 



(b) REGROUPING OF QUADRANTS 
(ENLARGED) TO SHOW CONTI- 
NUITY OF MOMENTUM AND 
VELOCITY CHANGE 



FlG 7-4_Rearrangement of Brillouin Zone for the Case Where the Lowest Energy 

Comes at the Corners. 

as a consequence of general theoretical arguments about the energy sur- 
faces near the corners of the zone. Since the behavior of P' under a force 
in (b) is just the same as that of? itself at the center of the zone in the case 
previously considered, it is evident that similar results will be obtained here 
and 1/2K' can be regarded as an effective mass. As Figure 7.4 illustrates, 
the velocity of the electron varies continuously with P' and shows no dis- 
continuous behavior at the edge of the Brillouin zone. 

Thus the effect of a force on the electron is to change its crystal momen- 
tum according to the equation 

P or P' = F (8) 

and, since its velocity is related to P or P' by the equation 

v = 2KP = P/m 
v = 2K'P' = P'/m, (9) 



or 
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v = F/m or F = ma 
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(10) 



(") 



where a = i> is the acceleration. 

In terms of P, P' and v the energy can be written as 

S = So + (P 2 or P' 2 )/2w = S + «y 2 /2. 

Thus the energy has the form of a constant plus a term which is formally 
identical with a kinetic energy associated with velocity v. Actually, the 
term mv 2 /2 represents a combination of changing potential energy of the 
electron in the periodic potential and changing kinetic energy as the Bloch 
wave function for the electron changes with changing P. Thus although 
the expression for the energy may be used as if it. arose from changing 
kinetic energy of motion of the excess electron, this feature should be con- 
sidered a mathematical simplification rather than a statement of a basic 
physical fact. As will be seen in Section 7.6, a similar simplification may 
be made in the case of a hole. 

We next must verify that under the influence of electric and magnetic 
fields, the Brillouin zone electron behaves like a classical one so far as its 
acceleration is concerned. This result follows directly from the F = ma 
together with equation for the force, which as we discussed in Section 7.1, is 
taken as 



-e(E + v X H/c) = F = mi = ma. 



(12) 



Here e is a positive number so that — e is the charge on the electron. This 
equation states that the rate of change of v in electric and magnetic fields is 
related to£, H, and v in just the same way as it is for a free classical electron. 
This result seems so obvious that it may be worth while to review briefly 
the route taken in reaching it. When we treat the case of the hole in the 
next few pages, we shall use the same machinery but will conclude that a 
plus sign should be used in the equation expressing a in terms of E, H, and v. 
In essence, we have indicated that it is a consequence of quantum theory 
that an electron may have a state of motion through a periodic potential 
field which gives it a crystal momentum P and velocity v which remain 
constant in time if no external forces act (the crystal being assumed perfect). 
This in itself is a surprising result; a classical electron moving in the 
crystal field would be deflected so often that its vector velocity would take 
on practically random values and would, on the average, be zero. It is, 
thus, the Schroedinger wave equation which leads us to the idea that the 
electron moving in the crystal can have values for crystal momentum and 
average velocity which are independent of time. Next, we find that from 
the same quantum mechanical framework we can deduce that P = F. 
Finally, as an approximation, we conclude that near the minimum energy 
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point of the zone v = 2KP, and as a further approximation, that v = P/m. 3 
(In later chapters we shall introduce m n and m p for effective masses and in 
Chapter 12 we shall consider a more complicated possibility.) Finally we 
equate F to the classical expression for force on an electron of charge —e 
and velocity v moving in a field E and H ; this last step represents in a 
sense an additional assumption in the exposition as presented here. Ac- 
tually this expression for the force is contained in Schroedinger's equation, 
and thus does not really represent a new assumption. 

Thus we reach the end of our treatment of the behavior of an excess elec- 
tron in the conduction band. The Brillouin zone theory indicates that it 
behaves in the same way as a free classical electron, provided its energy is 
always so low that it stays near the bottom of the conduction band. In the 
next paragraphs we shall verify that the behavior of a hole in the valence- 
bond band is similarly like that of a positively charged electron. 

7.6 THE BEHAVIOR OF A HOLE 

As we shall show, the hole is really an abstraction which gives a con- 
venient way of describing the behavior of the electrons. An essential 
feature in making this abstraction is the fact that a full Brillouin zone 
with every allowed state occupied can carry no net current. This feature 
permits the behavior of the hole to be found directly from the behavior 
which the missing electron would have if it were present. 1 

We indicated at the end of Chapter 6 that the average vector velocity for 
all the electrons in a Brillouin zone was zero because their individual 
velocities cancelled off in pairs. However, this conclusion was based on 
symmetry arguments and followed a discussion of the basic P-lattice of 
the Brillouin zone. (Figure 5.10.) After that discussion, we permitted the 
array of allowed points to be translated through the Brillouin zone keeping 
the density constant. This does not invalidate the conclusion that the 
full zone can carry no current, for when the velocities are added up for all 
the electrons, they are found to cancel so accurately that what is left over 
is far less than that of one electron ; in fact, the bigger the crystal and the 
more electrons in the zone, the more perfect is the cancellation. Thus the 
cancellation failure is negligible compared to the effect produced by re- 
moving an electron so as to leave a hole in the band. This is the key to 
the procedure for computing the effect of the hole. 

We reason as follows: Consider first a full Brillouin zone. Each elec- 
tron, as discussed in Section 6.1, contributes a current density { — e/V)v to 
the crystal. We single out a particular quantum state for consideration; 

3 When we apply this theory to practical cases, the unlimited variations of P produced by 
an electric field are reduced by the effect of collisions so that only small P values need be 
considered and the approximation applies. 

1 See Sections 1S.7 and 15.8 for an analytical treatment of the same subject matter. 
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call it state J, with velocity v,. We now add up all the currents due to all 
the electrons, but we split the sum into two parts with the electron in state 
s separated from the others. Since the zone is full, this sum is zero, and 
we have 



i^s 



from which we obtain another equation: 

Z {-e/V)Vi = {e/V)v>. 



(2) 



Now the left side of this equation is simply the net current due to a Bril- 
louin zone with every state occupied except s. Hence we see that this 
current is just what we would get if we had an empty zone with one posi- 
tively charged electron moving with v a . This is a lemma in the proof that 
holes act like positively charged electrons. Stated in words: 

The current due to a hole corresponds to a charge +e moving with the velocity 
associated with the vacant quantum state: 

I (hole) = (+e/F)v„ (3) 

Our next problem is to find how the current in the zone varies under 
applied electric and magnetic fields. For this purpose we must consider 
the dependence of energy upon momentum near the top of the energy band. 
As we did for the case of the bottom of the energy band, we disregard the 
possibility that this upper energy may be degenerate and conclude that 
the surfaces must be spherical and that the energy must vary as P 2 ; the 
problem of rearranging the zone if the highest energy occurs at the corners 
is handled in the same way; and we also use the electron mass m for 1/2K. 
However, there is a vital difference from the case of the bottom of the 
conduction band, since now we must have an energy which decreases as 
P or P' varies from the maximum energy point. Consequently, we have 

g = 6 - K{P X 2 + Py 2 + P.*) 

= & - P 2 /2m p (4) 

where m v is the effective mass for a hole. As for m m of (5) in Section 7.5, 
we shall not distinguish between m v and m in this chapter. From this we 
conclude that the relationship between velocity and momentum is 

v = -P/m, (5) 

and the relationship between velocity and energy is 

6 = S - mv 2 /2. (6) 

Hence if we had an isolated electron in this part of the Brillouin zone, it 
would behave in a very anomalous way, as discussed in connection with 
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the periodic motion in Section 7.3. For example, if the isolated electron 
were at the top of the band, the application of an electric field would 
decrease its energy rather than increase it. However, we do not have an 
isolated electron but instead a vacant quantum state at the top of an 
otherwise filled band. This situation obviously represents a minimum 
energy condition for the band as a whole since the electrons are all in lower 
states of energy than the vacant state. When the hole acquires velocity, 
it does so at the expense of displacing an electron from a lower state into the 
higher state. We must now see that the resulting changes in energy and 
current are those to be expected for a charge of +e and a mass of +m. 

We shall, for purposes of exposition, take a preliminary step in the proof 
by showing the behavior of a hole initially at rest corresponds to a positive 
and not a negative mass. Accordingly, we suppose that the vacant quan- 
tum state s is at the highest energy of the valence-bond band so that 
S = So and v a = 0. The application of an electric field E causes the 
quantum states to change according to the law P = —eE. Hence, the 
velocity of the vacant state, which bears a negative mass relationship to 
the momentum, will vary according to the equation 

v s = -P/m = +eE/m, ; (7) 

and the current will vary according to 

/ = ( + e'F,v a = (+<?/m)E. (8) 

This relationship shows that the current increases in the direction of the 
applied field. If V a = at I = 0, then the power delivered to the specimen 
would be given by the equation 

E ■ I = (<?/m)E 2 t (9) 

so that energy would be absorbed in accelerating the hole. Now suppose 
we try to interpret the behavior of the hole as corresponding to a charge e v 
and a mass m p . Then we would conclude that 

v = e p E/m p (10) 

and 

EI = (e 1 , 2 /m p )E 2 t. (11) 

It is evident that the only choice for e p and m v which will cause these two 
equations to give the same results as (7) and (9) is e p = e and m v = m. 
The choice m p = —m\s excluded by the sign of the power input equation — 
attributing a negative mass to the hole would cause it to deliver energy to 
the field rather than to absorb it. 

We shall next derive a more general expression which includes the effects 
of magnetic fields as well. For this case 

P = F = -e(E + v,X H/e) (12) 
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and mv, = -Pas before. From these equations we obtain 

mv a = +e(E + v,X H/c). (13) 

This is just the equation for F = ma for a positive charge. Furthermore, 
we have shown that the current density is given, as in equation (3), by 

/ = {elV)v s . (14) 

From these two equations we may obtain an equation relating 1 to E, H, 
and 7: 

/ = (e 2 /m)\(E/F) + (J X H/ec)] (15) 

for a hole in the valence-bond band where e is the positive magnitude of 
electronic charge. . 

A similar treatment for the case of an electron at the bottom of the 
conduction band, using equation (12) of Section 7.5 instead of (13) of 
Section 7.6 and {-e/V)v instead of (+e/F)v s , leads to 

/ = (e 2 /m)[(E/n - (' X H/ec)} (16) 

where again e is the positive magnitude of the electronic charge so that the 
charge on the electron is — e. 

It is evident that these equations differ in just the way expected for a 
change in sign of e. An interpretation involving a negative mass would 
lead to a current which was accelerated opposite toE for the case of H = 
and is thus inadmissible. (See Problem 7.) Equations (15) and (16) 
are in a form readily adapted to treating conductivity and Hall effect and 
are employed for the latter purpose in Section 8.7. 

The equation of conservation of energy may be established for the case 
of a hole in the valence-bond band by addition for all the electrons in the 
zone. Since, for each of them, dS/dt = F ■ v, the rate of change of the 
total energy in the band must be equal to the rate at which power is being 
furnished by the electric field. 

It is more illuminating, however, to interpret the energy changes as due 
to a "pseudo kinetic energy of motion of the hole". This idea is approached 
by calculating how the energy of the electrons depends on the energy of the 
empty quantum state. If the hole changes its velocity from v. = to 
v, = Vi, it is evident that the redistribution of the electrons has a net 
result of shifting one electron from energy 

S - mv t 2 /2 to So ( 17 ) 

or an increase in energy of mv^/2. Of course, the redistribution takes 
place by the flow of the quantum states 2 according to P = F, so that the 
2 It is evident from the discussion of Section 7.2 that the quantum states should be con- 
sidered to flow, whether occupied by an electron or not. 
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electron formerly at v t is not the one raised to v = 0. However, the net 
increase in energy for all the electrons is this amount. In other words, the 
energy gained by the electrons is equal to the kinetic energy calculated for 
a hole of mass +m moving with velocity v±. 

Finally, it may be added that the charge density in the crystal due to one 
hole in the valence-bond energy band may be determined from the wave 
functions occupied by the electrons. When this is done, it is found that 
■ the hole contributes a deficit electron density. Furthermore, the wave 
functions may be combined to produce a wave-packet for the hole and this 
wave-packet is found to have the group velocity v a corresponding to the 
velocity of the vacant quantum states combined in making the wave- 
packet (Section 15.8). Wave-packets for the positive charge of holes may 
thus be dealt with in the same way as the wave-packets for electrons. It 
is such wave-packets for holes that should be used to describe the flow of 
holes in »-type germanium in transistors or the Haynes-Shockley experi- 
ment of Section 3.1. 

7.7 SUMMARY 

Starting with the law derived from quantum mechanics that P = F and 
certain approximations about the behavior of the energy surfaces in a 
Brillouin zone, it has been shown that the behaviors of one electron in the 
conduction band and of one hole in the valence-bond band are given by the 
classical equations for an electron with minus charge and plus charge, 
respectively. The argument has been long, and the behavior of the hole 
has been shown to be essentially a shorthand way of describing the behavior 
of all the electrons. 

Some of the essential conclusions about the behavior of the electron and 
the hole are shown in Figure 7.5. Part (a) represents the Brillouin zone 
for the conduction band, a small portion near the minimum energy having 
been singled out for attention. In this region, the velocity and the accel- 
eration of an excess electron by a magnetic field will be as shown near the 
center portion of Figure 7.3. (It may have been necessary to rearrange the 
zone and use P' in Figure 7.5.) Part (b) similarly represents the highest 
energy in the valence-bond zone, in which the velocity and the acceleration 
in a magnetic field of a hole will be as shown near the corners of Figure 7.3. 
If an electric field E x alone were present, all the points in the zone would 
move to the left in accordance with the equation P x = —eE x , and the 
velocity of the electron and hole would change as v (electron) = P/m, 
V (hole) = -P/m. These differences are indicated in the four parts of 
Figure 7.5 for an electron and a hole, each initially in the state with v = 0. 
The energy for the quantum state occupied by the electron would increase 
with time in (a) and (c) and that of the state occupied by the hole (that is, 
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CONSTANT ENERGY CONTOURS- 
EQUAL ENERGY DIFFERENCES 
' 6=6 + P z /2m 




(3) CHANGE IN QUANTUM STATE OF 
AN ELECTRON INITIALLY AT THE 
STATE OF LOWEST ENERGY 



CONSTANT ENERGY CONTOURS- 
EQUAL ENERGY DIFFERENCES 
- 7 5=S -p2/2m 




(tO CHANGE IN QUANTUM STATE OF 
AN ELECTRON INITIALLY AT THE 
STATE OF HIGHEST ENERGY 



_ CONSTANT ENERGY CONTOURS- 
C EQUAL ENERGY DIFFERENCES 




20 am 



(C) CHANGE IN VELOCITY ANO ENERGY 
OF ONE ELECTRON IN THE CON- 
DUCTION BAND 



_ CONSTANT ENERGY CONTOURS- 

E EQUAL ENERGY DIFFERENCES 

•0,- „ v ^ jTs'=EI + mu2/j 
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(d) 



20 am 



CHANGE IN VELOCITY FOR HOLE 
IN VALENCE-BOND BAND AND 
CHANGE IN ENERGY, 6', OF ALL 
ELECTRONS IN BAND 



Fig. 7-5 — Summary of the Behavior of an Excess Electron or a Hole under Influence 
of Combined Electric and Magnetic Fields. Dashed Arrows Show Variation with 

Time. 



left empty) would decrease with time in (b) ; the total, however, of all the 
electrons in the valence-bond band would increase as shown in (d). 

If a magnetic field alone were applied, all of the quantum states would 
simply rotate as a rigid body about the Praxis, the angular velocity being 
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(as discussed in the next chapter; also see footnote below) ±eH/mc, the 
directions of rotation being those shown in Figure /.3. 

If both E x and H. are present, the dependence of the quantum states 
upon time will be as shown 1 : The momenta and velocities of the quantum 
states involved move on circles with constant angular velocity, lhe two 
fields result in a bodily rotation of all points in P-space (not merely the one 
initially at V = 0) about the point in the diagrams-a displacement 
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which obviously conserves the density of quantum states in the Brillouin 

Z ° A conventional scheme for representing the motions and energies of holes 
and electrons is shown in Figure 7.6. Here we consider the production of a 
hole-electron pair by a photon as discussed in Chapter 1 The photon 
excites an electron from a state in the lower band to one in the higher band 
h general, neither of the states is at the edge of the forbidden band, so that 
both the hole and the electron have velocities of motion and contribute their 
parent kinetic energies to the energy of the crystal. If the location of 
the exciting photon has been closely controlled (by focusing the light with 
a lens, for example), then the wave functions for the hole and the electron 
will be represented by localized wave-packets. The effect of the electnc 
i This interesting case is not essential to the later development-the fornmlae of the Hall 
effecting obtained in a simpler way. It is instructive, however, to work it out » detad. 
SesSn d be conipactl/obtained in con^s notation as follow, Lett,ng .^H, 
the equation m» = ±e{E x - HH./c) is solved by • - -uEJH. + const exp C±«'«'/»<> 
lothat O corresponds to „, = -cEJH,. This can easdy be mtegrated to g ,ve «,«. 
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field is to tilt the energy bands since an electron in a wave-packet state at 
the bottom of the conduction band will have a lower energy at the positive 
side of the crystal. In the absence of an electric field, the lowest energy 
State for the excited pair is that for which the electron is at the bottom and 
the hole at the top of their respective bands. The general rule that holes 
tend towards high levels on energy diagrams is, of course, just the same as 
saying that electrons tend towards low levels. The effect of the electric 
field will be to accelerate the wave-packets in the indicated directions. We 
shall return to a more detailed treatment of such energy level diagrams in 
Chapter 12 where they are used in analyzing the rectification of p-n junc- 
tions and related topics. 

In order to proceed further to the discussion of conductivity and Hall 
effect we must consider the effect of random processes upon large numbers 
of holes and electrons behaving as shown in Figure 7.5. This we do in the 
following chapter. 

7.8 CONNECTION BETWEEN QUANTUM AND CLASSICAL 

MECHANICS 

From the material presented in this chapter, it can be suggested how the 
results of quantum mechanics may merge with those of classical mechanics 
in the limiting case. As we have discussed, it is possible to build up wave- 
packets which move with certain group velocities. If the electron is not 
in a crystal (so that the periodic potential energy is eliminated), then 
quantum mechanics leads to just the Same formula for energy, as a function 
of momentum, as does the classical theory for an electron. Consequently, 
the group velocity, momentum, and energy are related as in the case of a 
free particle. Furthermore, in electric and magnetic fields, the electron is 
subjected to a force which changes its momentum as if it were a classical 
particle. As a result, the wave-packet will behave in just the same way 
as a free electron. 

Analysis along the lines presented above can be carried out in detail. 
When this is done, it is concluded that so long as the dimensions of the 
environment in which the electron moves are very large compared to its 
-wave length, the wave-packet may be regarded as traveling like a classical 
free particle. The requirement that the dimensions be large is not very 
stringent. For example, wave-packets for electrons in the smallest of 
vacuum tubes would be a thousand times smaller than the grid wire 
spacing and thus could be dealt with as particles. 

In this sense, classical mechanics may be regarded as a limiting case of 
quantum mechanics just as geometrical optics is a limiting case of physical 
optics, which deals with diffraction phenomena for light. In both cases, 
when the size of the structures involved is very large compared to the wave 
length, the appropriate limiting methods may be employed. The same 
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differences in treatment are used in connection with both radio waves and 
microwaves. For antenna design and small-scale effects, diffraction phe- 
nomena are vital; for large-distance effects such as ionosphere reflections 
and the target end of radar systems, geometrical optics suffices. 

There is, of course, far more to the philosophy of quantum mechanics as 
applied to large-scale phenomena than is involved in showing that the 
electron wave-packet behaves like a classical electron. However, a treat- 
ment of these important questions belongs not in a treatise on a limited 
phase of the role of electrons in semiconductors, but in texts on quantum 
mechanics and the philosophy of physics to which the interested reader is 
here referred. 1 

Problems 

1. Verify, by considerations of symmetry, from Figure 7.3 that, if an 
electric field E x = E y is applied, the electron undergoes periodic motions 
in the x and y directions in space with the same period, so that it periodically 
retraces the same path. What can be said of the motion if E x = 2£„? 

2. By the method of the footnote of Section 7.7, find the trajectories in 
fields E x and H z for holes and electrons with initial velocities v x and v y . 

3. Prove that, if the initial velocities at/ = are v x = v cos 6, v y = v sin 0, 
the displacement at time t when averaged over 6 is independent of v . 
(This shows that the effect of random velocities in Section 8.6 can be 
disregarded.) 

4. For some crystals, such as bismuth and zinc, the important parts of 
the Brillouin zones have energies of the form 

& = S + K X P X 2 + KyPy 2 + K Z Pf. 

Find the equation of motion which replaces ma = F for this case and 
integrate these equations for F due to H z . 

5. What happens to the mass of the crystal as a whole when the hole and 
electron of Figure 7.6 recombine? 

6. Verify by using in part equations (15) and (16) of Section 7.6 that 

the power expended by the electric field E( = J I- EdV\ can be accounted 

for by the changing energy of the electron or hole. 

7. Show that a particle of negative mass initially at rest so that / = at 
/ = produces a negative resistance and delivers during time t an energy 

«W/2|»|. 

Show that a particle of positive mass absorbs an amount of energy given 
by the same formula. 

1 P. W. Bridgman, The Logic of Modern Physics, Macmillan, New York, 1927. R. B. 
Lindsay and H. Margenau, Foundations of Physics, John Wiley & Sons, New York, 1936. 



CHAPTER 8 

INTRODUCTORY THEORY OF CONDUCTIVITY 
AND HALL EFFECT 

8.1 INTRODUCTION 

In this chapter the laws of motion for electrons and holes derived in 
Chapter 7 are applied to the analysis of conductivity and Hall effect. The 
exposition presented here is based on a simplified treatment of the "mean 
free time". In Chapter 11, a more detailed discussion of the mean free 
time is presented. The purpose of presenting the simplified treatment at 
this point in the text is to illustrate the connection between experimental 
results and the abstract matters discussed in Chapters 5, 6, and 7 before 
proceeding with the theoretical topics of the next three chapters, which 
treat energy level diagrams, the Fermi-Dirac Statistics, and collision 
processes. In addition, the simplified treatment, in anticipating the results 
to be derived later, serves as an introduction to the following chapters. 

In this chapter, we shall introduce, in addition to the mean free time, the 
concepts of an equilibrium thermal distribution of electrons and holes and 
of random processes which bring this equilibrium about. However, these 
latter two ideas will enter the equations only through the mean free time, 
denoted by t. 

There are two important ways of thinking about the current density in a 
semiconductor. Although mathematically equivalent for most purposes, 
the mental images associated with these are somewhat different and for 
this reason both methods are described. According to one method, the 
electrons or holes are thought to be in definite quantum states in the 
Brillouin zone and thus to be carrying currents of (jFe/V)v in accordance 
with equations (1) of Section 6.1 and (3) of Section 7.6. This method is 
well adapted to calculating currents on the basis of diagrams showing how 
applied electric and magnetic fields disturb the distribution of electrons in 
the momentum space of the Brillouin zone. According to the other 
method, the holes and electrons are treated as classical particles having at 
any instant definite locations and velocities. This treatment is justified by 
the analysis presented in Chapters 6 and 7 which shows that wave-packets 
for holes and electrons behave like classical particles having effective 
masses which may differ from the free electron mass. According to this 
particle treatment, the electrons and holes move in the presence of electric 
and magnetic fields in curved paths (like the parabola for a falling body) 
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between collisions. (In Chapter 11, a more general method of attack is 
used which is applicable to the case in which the effective mass concept 
must be modified.) 

There are cases, however, in which the wave-packet method is appro- 
priate to the physical situation whereas the method of using individual 
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states in the Brillouin zone is not. The latter is adapted to the case in 
which the wave function gives equal probability that the electron (or hole) 
is in anyunit cell of the crystal. In the experimentof Haynes and Shockley, 
Section 3.1, on the other hand, the holes are injected at a definite point in 
the germanium and when first injected have negligible probability of being 
anywhere except near that point. After a measurable lapse of time, the 
probability that there are holes at the collector point first rises appreciably 
above zero. The appropriate description, in this case, is that of wave- 
packets or particles, and one may think of the holes as being injected and 
pulled along the germanium as described in Chapter 3. The justification 



8.2) 



RANDOM PROCESSES AND TRANSITION PROBABILITIES 



189 



for this procedure is the mathematical one that wave-packets of electrons 
or holes have the behavior of classical particles. 

Figure 8.1 has been prepared to indicate the logical connection among 
the various ideas discussed in this chapter. The section numbers given in 
the figure indicate where the topics are discussed. This figure will probably 
be of most value after Sections 8.2 to 8.7 have been read. 

8.2 RANDOM PROCESSES AND TRANSITION PROBABILITIES 

As stated in the introduction to this chapter, the theory of the mean free 
time is based on the idea of random processes. As its name suggests, the 
mean free time, which has already been referred to several times without 
being defined, is related to the time during which an electron moves without 
being affected by a random process. We shall consider first the nature of 
random processes, and then show how these processes can lead to an 
equilibrium distribution. In the following section we shall define the mean 
free time and relate it to these random processes. 

The random processes referred to occur because the crystal is imperfect. 
If the crystal were ideal so that the potential field in which the electron 
moves were perfectly periodic, then the electron would remain in one quan- 
tum state indefinitely; or, if acted upon by electric and magnetic fields, it 
would change its momentum according to the law P = F. In a real crystal, 
however, the potential field is not perfectly periodic and, as indicated in 
Figure 8.1, there are three main classes of imperfections. The heat sto.red 
in the crystal is present in the form of vibrations of the atoms. If tempera- 
ture vibrations are lacking, imperfection may be present in the form of 
impurity atoms, such as donors or acceptors as discussed in Chapter 1, or in 
the form of places where atoms are missing from their normal positions and 
are perhaps squeezed into places in the crystal where they do not belong. 
(Additional imperfections, such as "dislocations", may be present also.) 

A description of the motion of electrons in imperfect crystals is based on 
the quantum-mechanical idea of transition probabilities. When the crystal 
is imperfect, the electron does not stay in one state of the Brillouin zone 
indefinitely; instead, after a time, it will make a quantum jump to another 
state in the Brillouin zone. This process is illustrated in Figure 8.2(a) 
which shows the allowed states near the lowest energy of a Brillouin zone, 
with an excess electron occupying one of them. As a consequence of the 
imperfection of the periodic field, there is a transition probability that the 
electron will jump from the state which it occupies to any one of a number 
of other states. This probability depends in a complicated way upon the 
values of P for the initial and final states and is not restricted to such a small 
group of states as that shown in Figure 8.2(a). 1 Figure 8.2(b) shows a 
corresponding picture for a hole near the top of the valence-bond band. 

1 A discussion of the transitions is given in more detail in Chapter 11. 
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The transition of the hole comes about as a consequence of an electron 
jumping into it as indicated by the arrows. In this case the Pauli exclusion 
principle exerts an influence on the transitions, since an electron can make 
only those jumps which carry it to a state not already occupied by another 
electron. This means that the only jumps possible for electrons are those 
which end in the hole; and whenever such a jump occurs, the result is 
equivalent to a jump for the hole. Although the processes illustrated in 
(a) and (b) have this great difference, the net result is that the transition 
probability for the hole is approximately the same as that for an electron. 
This may be seen as follows: 
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Fig. 8-2 — Transitions for an Electron and for a Hole. 



In Figure 8.2(a) the one electron can make jumps to any one of, say, 
A states. In Figure 8.2(b) each electron can make a jump to only one 
state; however, there are now ^electrons which may jump. Furthermore, 
the probability of a jump to a vacant state is about the same for an electron 
near the top as for one near the bottom of a band. (Discussed in Chap- 
ter 11.) As a consequence, the probability that the hole in (b) makes a 
jump corresponds to A electrons each having one possible transition whereas 
the probability that the electron in (a) makes a jump corresponds to one 
electron with A possible transitions. These two over-all transition prob- 
abilities are thus comparable, a result which, as we shall later show more 
fully, accounts for the fact that holes and electrons have approximately 
the same mobility. 2 

2 In Chapter 1 we discussed the similar dynamics of holes and electrons. We here point 
out the similarity of the effects of random processes upon them. In Chapter 10 we shall 
show that their thermal equilibrium statistics are equivalent. 
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For the purposes of this chapter, we shall suppose that only a small 
fraction of the states in the conduction band are occupied by electrons, so 
that we may disregard the possibility that a transition is prevented because 
the end state is occupied. A similar assumption is made for holes. Ac- 
tually the removal of these assumptions is readily accomplished in the 
analytical theory; it would add unnecessary complications, however, to 
treat the more general case in this chapter. 

If the jump which the electron makes is due to thermal vibrations, it will 
(according to the theory of transition probabilities) gain or lose energy each 
time it jumps. In this way, it can exchange thermal energy with the 
vibrating atoms of the crystal and come to equilibrium. It is, of course, 
meaningless to speak of one electron being at thermal equilibrium since 
thermal equilibrium is a statistical situation in which some electrons have 
large energies and some have small energies. However, if a large number of 
electrons are present in a conduction band, their equilibrium distribution 
is a definite thing which we shall discuss in the next two chapters. It is 
worth mentioning that the thermal equilibrium distribution of electrons in_ 
a crystal, or of atoms in gas or any similar example, is independent of the 
exact nature of the transition probabilities which permit it to arise. In 
other words, no matter what the nature of the transitions is, so long as they 
provide a means of exchanging energy, the system will eventually reach one 
and the same end state of thermal equilibrium. The rate, however, at 
which the end state is approached depends in detail upon the nature of the 
transitions — an idea which brings us to the consideration of the mean free 
time. 

8.3 THE MEAN FREE TIME 

As stated before, the theory of transitions or collisions presented in this 
chapter is based on certain simplifying assumptions. In particular we 
assume: 

(1) The probability that the electron {or hole) makes a transition in any 
small interval dt of time is dt/r where t is a constant. 

This assumption has two independent implications: (a) The probability 
of transition does not depend on the elapsed time since the last transition, 
(b) The probability of transition does not depend upon the quantum state 
occupied by the electron or hole. Both of these implications come from the 
statement that the "probability parameter t", which has the dimensions 
of time (seconds), is a constant so that it does not depend on the factors 
mentioned in (a) or (b). By "a constant" we mean, of course, constant 
under given conditions of temperature and composition. Also t will not 
be the same for electrons as for holes. 

As we shall show t is the mean free time; the natural definition, how- 
ever, for the mean free time is stated in words very different from those 
employed in (1) ; for this pedagogical reason, we shall not call T the mean 
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free time until, with the aid of equation (1) of Section 8.3, we have defined 
the mean free time, denoted by i, and proved it equal to T in equation (5) 
of Section 8.3. 

The second assumption is: 

(2) The end state of the transition is independent of the initial state, and the 
probability of arriving at any particular end state is proportional to the prob- 
ability that the end state would normally be occupied in the thermal equilibrium 
distribution. 

What assumption (2) means is that no matter how the electrons are dis- 
tributed among the quantum states at t = 0, as soon as each one has made 
a transition, they will be distributed in the thermal equilibrium distribution. 
(In other words, they cannot remember what they were doing before the 
collisidn.) 

Assumptions (1) and (2), together with the equations of Chapter 7, 
furnish the basis from which we derive the equations for conductivity and 
Hall effect. These final equations correspond to a physical picture, which 
we shall now describe, inaccurately, in order to suggest the significance of 
(1) and (2) in the later development. According to (1), the electrons are 
accelerated by the applied fields for approximately a time t; then they 
collide. If these collisions produced only small deflections of the velocity 
of the particles, then the condition after collision would be influenced by 
the field acting before collision. According to (2), however, the collision 
wipes out all memory of the pre-collision state of the particle. Thus the 
behavior of the particles under the influence of fields is substantially that 
which would occur t seconds after an instantaneous application of the 
fields. 1 

Before defining the mean free time and proving its equality to the 
probability parameter t in assumption (1), we shall attempt to make (1) 
appear to be reasonable by giving a discussion of collision processes on the 
basis of a particle model. According to this model, the electron moves as 
a particle in a straight line path (or a curved path if electric and magnetic 
fields are present) for a length of time and is then abruptly deflected, after 
which the process repeats. These abrupt deflections are analogous to 
collisions between molecules in a gas and may also be referred to as colli- 
sions. In particle language, assumption (1) is equivalent to saying that 

1 As we shall discuss in Chapter 11, neither assumption (1) nor (2) is in good agreement 
with the best theories: the time between collisions depends in an important way on the speed 
of the electron (or hole) and, after a transition, energy and direction will not be, on the 
average, random as suggested by (2). If the time between collisions is properly averaged, 
however, by procedures discussed in Chapter 11 (taking into account the variation of time 
between collisions with velocity and the dependence of velocity after collision upon velocity 
before collision), then the so obtained average t may be inserted in assumption (1) and 
employed with assumption (2) and the analysis presented in this chapter will give the correct 
results: that is, the same as those obtained by the more involved procedures of Chapter 11. 
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the probability that the electron will suffer a collision is independent of how 
long it has traveled since its last previous collision. This is intuitively 
seen to be a reasonable result for the motion of molecules in a gas. Con- 
sider a molecule selected at random, and call it molecule (a) ; although 
molecule (a) has had a collision at some time in the past, the region of space 
into which it is moving contains molecules whose paths were unaffected by 
the previous collision of molecule (a). Thus the environment into which 
molecule (a) moves is unaffected by the length of time molecule (a) has 
been moving since its last collision, and, consequently, the probability that 
it surfers a collision in the next increment dt of time should be independent 
of how long it has been moving freely. 2 

The quantum mechanical theory of transition probabilities leads one to 
the same conclusion: the probability that a quantum jump may take place 
is also dtjt regardless of how long the electron has stayed in the quantum 
state. 

We are now in a position to define and discuss the mean free time. We 
shall do this first in the case of molecule (a) in a gas so as to have a specific 
picture in mind. We follow the course of molecule (a) over a long period 
of time T which starts and ends at instants of collision. During time X, 
molecule (a) has C collisions (not counting the starting one), the intervals 
between them being t\, (%,•••, tc. These intervals are called "free times", 
during which the molecule moves freely uninfluenced by random processes, 
and their average is the mean free time. This leads to the definition: 

The mean free time, l, is the (unweighted) average of the free times between 
collisions: 



t = 



h + h H V t c 



T/C. 



(1) 



[The word (unweighted) is inserted in the definition to suggest to the 
reader that some other averaging procedure might be employed. We meet 
such a case in Section 8.7; and a paradox encountered there is explained 
as being due to a difference in averaging procedures.] 

We must next study the distribution of the free times t\ to tc; as we 
shall see their average is t; however, all values from zero to many times t 
occur in the distribution. 

The analysis is most easily visualized by considering not C successive 
free times for one molecule, but instead one free time each for C molecules. 
By assumption (1), any free time, no matter how it originates, has the same 
statistical behavior so that the same distribution of free times will be 
obtained for either case. Suppose that by chance all C of these molecules 

2 This problem is closely analogous to the coin which is about to be tossed. No matter 
how many times it has come up heads in succession, the probability is always ^ that it will 
come up heads on the next throw* 
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have collisions simultaneously (in pairs) with other molecules at t = 0; 
we then follow each of these molecules until it has its next collision, record 
its free time in our list of t u h to tc ; thereafter, we may disregard its sub- 
sequent behavior since we have selected for study only the first free time 
for each molecule after its / = collision. The significant statistical 
features of this situation can be derived by consideration of the number of 
molecules, denoted by C(t), which have not collided up to time / after the 
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p IG _ 8-3— Dependence of the Number of Uncollided Molecules upon Time. 

/ = collision. How does C(t) vary with time? Figure 8.3 shows how 
the dependence is derived: at time / there are C(t) molecules which are 
uncollided; in an interval dt the probability that each of these will collide 
is dt/r. [Assumption (1).] Hence, the number colliding in dt will be 
simply C(t)dt/r, and during dt the number of uncollided molecules will 
decrease by this amount. Expressed as an equation 

dC(t) = -C(t)dt/r. (2) 

This is a differential equation for C(t) which can be integrated as follows: 

-dt/r = dC(t)/C(t) = din C(t) 

In C(t) = -t/T + a 

Cit) = e°e-" r - Ce~ ilT (3) 

the constant a being evaluated so that C(t) = C when / = 0. From this 
expression we can find the number of mean free paths which end in dt at /. 
As discussed this number is C(t)dt/r and is thus: 

number of molecules 

colliding 
between / and t + dt 

8 Or which is entirely equivalent, consider C collisions selected at random and measure time 
for each molecule from the instant of collision. Or consider one molecule and C of its free 
times measuring time for each one from the instant of the collision which initiates it. 



= Ce- tlT dt/T 



(4) 
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Hence, among all the C collisions considered, the above number have a 
free time of / and, there/ore, contribute t times Ce~ t!r dt/r, to the sum 
h + h + ■ ' * + tc- Adding this up over all collisions leads to a mean free 
rime given by 

h + ft + • • • + fc = J. f° 
C " CJo 



t = 



tc<r tlr dtjT = t, 



(5) 



(the integral 

f te- t,T dt/T = t f e-^QMdit/r) =t[ e~ x xdx = t (6) 

Jo i/0 Jo 

being readily integrated by parts or found in the tables). 

It is worth while to review briefly what the argument has been in arriving 
at equation (5). First assumption (1) was introduced: (For a model like 
a gas, arguments were presented to show that (1) was reasonable with the 
probability parameter t being constant and independent of the time after 
the previous collision.) It was not at that point stated that T was the 
mean free time. Second, a mean free time t was denned. Third, the dis- 
tribution of free times resulting from assumption (1) was evaluated and 
from this distribution the value of I was computed. It was then found 
that the value of I is t. 

The same mathematical results would follow from assumption (1) no 
matter what model was used. We could equally well have used electrons 
described either by the Brillouin zone scheme or the particle scheme. If 
we had started with C uncollided electrons at / = 0, we would have found 
formulae (2) to (5) from just the same analysis. The same results would 
apply for holes. 

So far in this discussion, no use has been made of assumption (2). The 
results in the next section, however, will depend upon it. As stated earlier, 
detailed consideration of the processes leading to collisions will be discussed 
in a later chapter. 

8.4 BRILLOUIN ZONE TREATMENT OF AVERAGE VELOCITY, 
MOBILITY, AND CONDUCTIVITY 

We must now apply the concept of the mean free time and the distribu- 
tion of free times given by equation (4) of Section 8.3 to a calculation of the 
current produced by an electric field. There are several ways of viewing 
this problem and, since all involve ideas which are useful in gaining insight 
into the conduction process, we shall compare them. 

First is what we may call the Brillouin zone or distribution-in-momentum 
method. This is the one which best adapts itself to detailed analytic cal- 
culation when variations of t with the initial state are taken into account. 
We shall, however, deal with it here in accordance with assumption (1) 
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that t is constant. Before an electric field is applied, the electrons occupy 
a set of quantum states near the lowest energy in a manner described in 
detail in Chapters 9 and 10. This situation is represented schematically 
in Figure 8.4(a) for electrons and 8.4(b) for holes. For the cases con- 
sidered here, only a few electrons or holes are imagined present so that they 
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p IG g_4 — Effect of Electric Field, E x , on the Distribution of Electrons and Holes in 

the Brillouin Zone. 



occupy only a very small fraction of the states, inside the shaded areas, and 
a negligible number have energies outside the shaded areas. Under 
thermal equilibrium, this distribution is symmetrical so that just as many 
electrons are moving to the right as to the left and the average values of 
P and v are zero. 

We shall now specifically consider the case of electrons assuming that 
there are w of them present in the crystal, whose volume V we shall later 
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take equal to unity so that n will be the density or concentration of elec- 
trons (that is, number per unit volume). We shall find it convenient to 
deal with the total crystal momentum of the n electrons which is 



i\ot = Pi +/** + ■••+ P». 



(1) 



The average momentum of an electron is evidently i\ot/» and its average 
velocity v is 

V = (vt + i/ 2 H + v n )/n = Ptot/nm (2) 

because of the relationship P — mv which holds near the bottom of the 
conduction band, as discussed in Section 7.5. The current density of each 
electron, as discussed in Section 6.1, is { — e/V)v so that the current due to 
the n electrons is 



/ = (-e/V)( Vl + v 2 + • • • + v n ) = {-en/F)d 
= {-ne)v (for V = 1). 



(3) 



Under the influence of an electric field, which we take to be in the Af-direc- 
tion with component E x , all the electrons change their momenta according 
to P = — eE. This shifts the distribution to the left as shown in Fig- 
ure 8.4(c). At the same time, collisions tend to restore the electrons to 
the random distribution shown in (a). The steady state is reached when 
these two tendencies cancel with a net shift of the distribution to the left. 
We shall, therefore, calculate the change in time dt produced in P% Q t due 
to the field and due to collisions and by equating these find an equation 
describing the steady state. We need, of course, consider only the ^-com- 
ponent. This gives 

dPxtot = —neE x dt (4) 

for the change due to the field. During the same time, a fraction dt/t of 
the electrons suffer a collision according to assumption (1), and return to 
the random state according to assumption (2). Furthermore, according 
to assumption (1), the electrons colliding are a random sample of all the 
electrons; they, therefore, eliminate just their proportional share of the 
total momentum and reduce P^tot by an amount 



dP x 



-Pxtotdt/r. 



(5) 



Equating the sum of these two changes to zero gives the steady state 
condition 



— Pxtotdt/T - neEJt = 
•Pxtot = —neE x -x 



(6) 
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from which we readily infer for the vector field E that 

•Ptot = —neEr (7a) 

P = Ptot/w = -eEr (7b) 

v = -(er/m)E (8a) 

/ = (-ne)V = (ne)(er/m)E, (8b) 

These equations evaluate the current in terms of the constants t and m 
which describe the behavior of the electron in the conduction band. As is 
indicated in Figure 8.4(c) they correspond to a displacement of the dis- 
tribution of electrons in the Brillouin zone as shown. 

Equations (8a) and (8b) can be re-expressed in terms of the conductivity 
and the mobility. By definition the mobility is the ratio of average velocity 
to electric field without regard to algebraic sign. Thus we have 

M« = er n /m„ and v n = -jtt„£ (9) 

where the subscript n shows that the mobility has been evaluated for 
negative carriers, that is excess electrons. The conductivity is by definition 
the ratio of / to E and is 

a = ne 2 T n /m„ = ney. n (10) 

and is thus the total charge density, again without regard to algebraic sign, 
times the mobility. 

Precisely similar relationships apply for holes. We shall use p, for 
positive carriers, to represent the concentration or density of holes and also 
as a subscript. The shift in momentum is again derived from the law 
P = F. The sumPtot off* over the unoccupied states, or holes, will change 
by —epEdt owing to the field and will change because of collisions by 
—Pt tdt/Tp. Hence the steady state will be given by a formula like 
equation (5) for electrons and, as the reader may verify, the relationships 
p = —m p v and / = {+e/V)v for holes (discussed in Section 7.6) lead to 

M?> = er p /m p v p = +n P E (11) 

a = peix p . (12) 

When both holes and electrons are present, as occurs in the intrinsic 
range, the total current density is the sum of the hole and electron currents 
and the conductivity is 

c = neiin + pey. p (13) 

where we have used n and p for densities as follows: 

n = density of negative carriers = electrons/cm 3 ; (14a) 

p = density of positive carriers = holes/cm 3 . (14b) 
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At this point in the exposition the reader may find it advantageous to 
consider Figure 8.1 and, with its aid, review the argument up to this point. 

The units used in this discussion are absolute electrostatic units. It 
may be instructive to convert them to practical units and to consider some 
typical values. 1 We shall use the subscript L (for laboratory) to denote 
practical units since p has already been required for holes. In terms of 
the absolute electrostatic units, the mobility has the dimensions of cm/sec 
per electrostatic volt/cm. Thus we have 

H = er/m cm 2 /e.s. volt sec. (15) 

For practical work fields are measured in volts/cm and since 1 e.s. volt = 
300 volts and the value of p in practical units is 2 

hl cm 2 /volt sec = /*(cm 2 /e.s. volt sec)(l e.s. volt/300 volts) 

= (ju/300)cm 2 /volt sec = *t/300?» cm 2 /volt sec. (16) 

For hl = 10 3 cm 2 /volt sec or p = 10 s • 300, the mean free time t is 
t = 10 3 • 300 • m/e = 3 X 10 5 X 9.1 X 10~ 28 /4.8 x 10" 10 

= 5.7 X 10- 13 sec. (17) 

This time is so short that relaxation effects could not be observed even with 
waves in the millimeter range. 

A typical value of conductivity (for germanium) is 1 ohm -1 cm -1 . If 
we express e in practical units e^ = 1.60 X 10 — 19 coulomb, then the con- 
ductivity in practical units will be 

<r L = ne Lf i L (18) 

corresponding to 

n = e L /ezji L = 1/1.6 X 10~ 19 X 10 3 = 6 X 10 15 cm" 3 . (19) 

It may be worth mentioning that a conductivity of 1 ohm -1 cm -1 is so 
high that, in the body of the semiconductor, displacement currents (due to 
changing dielectric displacement) may be neglected compared to conduc- 
tion currents (due to the motion of holes and electrons) at least up to 
microwave frequencies. (This is not true in the high resistance layers of 
rectifying junctions, however.) In absolute e.s.u., the displacement 
current is 

(1/4t)««£ = k/E/2 (20) 

where w = 2ir/ is the angular frequency and k the dielectric constant; for 
germanium k is 19. (A better value is 16; see p. 224.) The conductive 
current is 

uE = 9 X 10 n <rz,£. (21) 

1 A more complete list of formulae are given in equations (8) to (17) of Section 8.8. 

2 Some rules for changing units are given in an appendix. 
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Hence the ratio 

displacement current = ^^ ^ ? x ^^ ^ hQ5 x ^-ly. (22) 
conduction current 
Hence the displacement current is negligible up to / = 10 10 sec" 1 corre- 
sponding to a 3-cm wave length in vacuum. 

[The same result can be obtained more compactly in M.K.S. units for 
which, as discussed in an appendix, the conductivity c n is obtained from 
irias follows: 

<r L ohm -1 cm -1 X (100 cm/1 meter) = 100<7 L ohm -1 meter -1 

= <tm ohm -1 meter -1 . (23) 

The ratio of displacement current to conduction current is 

(dD/dt)/<TME = Kiooc/au- ( 24 ) 

For <tm = 100 ohm -1 meter -1 , corresponding to 1 ohm -1 cm -1 , and k = 19, 
and e = 8.85 X 10 -12 this leads to equation (22). The value of co for 
which the ratio of currents is unity is equal to the reciprocal of the relaxation 
time for dissipation of charge density, as may be seen by calculating the 
rate of change of charge density M.K.S. units: 

dp/dt = -V • / = -V • cmE = - (<rjtfA*o)V D = - (<tk/kso)p (25) 
which leads to a rate of decrease in In p of (dp/dt)/ > = -W^o in agreement 
with the result stated above.] 

8 5 PARTICLE TREATMENT OF DRIFT VELOCITY, MOBILITY, 

AND CONDUCTIVITY 

In this section we shall derive the same relationships as in Section 8.4 
but on the basis of a wave-packet or classical particle model. We shall see 
that proper care must be taken in averaging over the free times in order to 
avoid error by a factor of 2. 

We shall again deal with electrons and simply indicate how the same 
results may be obtained for holes. Thus we consider n electrons acting as 
mobile particles in the semiconductor. According to assumption (1), each 
electron has a probability dt/r of colliding with an imperfection in the 
lattice (not with another electron; electron-electron collisions can be 
neglected for most purposes in the theory of semiconductors). Each 
collision, in accordance with assumption 2, brings the electron into the 
thermal equilibrium distribution, statistically speaking, so that its direc- 
tion and speed of motion are uncorrected with those before collision. As a 
consequence, its path is an irregular random motion like that shown in 
Figure 8.5(a). Each leg of this motion is a "free path", and the average 
of them is the mean free path. There is a relationship between the mean 
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free path and the mean free time. In fact, if the average thermal velocity 
of motion is vg, we have 

mean free path = ver. (1 ) 

We shall return to a discussion of mean free paths in Chapter 11 ; in this 
chapter, as already stated, we shall emphasize the mean free time. 




, (INTERVALS BETWEEN 

COLLISIONS SHOWN 
EQUAL FOR SIMPLICITY) 



(8) RANDOM MOTION OF AN ELECTRON 
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(b) COMPONENT OF ELECTRON MOTION 
ADDED BY ELECTRIC FIELD 




(C) COMBINATION OF O) AND (D): ELECTRON 

MOTION IN AN ELECTRIC FIELD 

p I0> 8_5 Effect of Electric Field in Superimposing Drift on Random Motion. 

Under the influence of an electric field, the free paths become curved ; 
in fact, they are segments of parabolas, like the trajectory of a falling body. 
During a path of duration t u the net displacement due to the acceleration 
(a = -eE/m) produced by the field is \at\ 2 , as for a falling body. 
Thus after C collisions, the path would be deformed from (a) to (c)~in 
Figure 8.5 by a displacement of \at x 2 for each path or (C/2)ati 2 for all 
the paths if the free time for each path were h. However, the free times 
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are distributed according to the statistics given in equation (4) of Sec- 
tion 8.3 so that the free time varies. The total displacement is thus 

(a/2)(/ 1 2 + f 2 2 + --- + /c 2 ). (2) 

According to equation (4), a number C exp (— t/r)dt/T of free times have 
durations between t and / + dt, and these contribute an amount 

t 2 Ce~ th dt/r (3) 

to the sum in (2). Adding up for all the possible free times gives 



h 2 + h 2 + 



+ tc 2 =£' t 2 Ce 



■tfr 



dt/r 



= Cr 



' / (//t) 

«/0 



2 e- ((/T) i(//T) 



Or 2 J' x 2 eT x dx 
2Ct 2 . 



(4) 



Thus the average value of t 2 for each collision is 2t 2 and t 2 is not simply the 
square of I which would be t 2 . The fact that the average of the square of 
a quantity is greater than the square of its average is a generally valid result. 
For example, on a 60-cycle line the average square of voltage is (HO) 2 , but 
the average voltage is zero and so is the square of the average. (Further 
examples are discussed in Section 14.3.) Hence the displacement due to 
the C collisions is 

(fl/2)2Ch a = Car 2 . (5) 

These C collisions will require, on the average, a time Ct, since this is the 
definition (1) of Section 8.3 of the mean free time. Consequently, the 
average velocity due to the field is 

v d = aCr 2 /Cr = ct (6) 

where Vd is the average velocity with which the particles "drift" down the 
field. 

If the acceleration is due to an electric field E, its value is a = —eE/m and 

Vd = - (er/m)E (7) 

which is equivalent to expression (8a) obtained in Section 8.4 for v. 

The same expression for the current may be obtained for the particle 
picture as may be seen with the aid of Figure 8.6. Here we consider the 
number of electrons flowing across area A in time /. All the electrons in 
the volume ^|v<i|/ will cross the area carrying a charge of ( — e) from right 
to left or +<? from left to right. There will be some flow of electrons in and 
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out of the volume owing to their random motions ; this flow will cancel 
out on the average, however, since the random motions alone produce no 
net current. Thus if the density of electrons is n per unit volume, the net 
charge transported in the direction of the field will be en\v d \At giving rise to 
a current density which is parallel to E and has the value (current density is 
charge per unit time per unit area) 

/ = {env d At)/At = env d = (ne 2 T/m)E (8) 

again identical with the result of Section 8.4. Similar results will obviously 
be obtained for holes. 




VOLUME OF 
ELECTRONS FLOWING 
ACROSS A IN TIME dt 



charge = enAv d t 

CURRENT DENSITY = erW d 



Fio. 8-6 — Relationship Between Drift Velocity and Current. 

(We shall here, parenthetically, discuss a difficulty which can arise in 
applying the mean free time concept carelessly to the computation of drift 
velocity. We argue, erroneously, as follows: on the average, a particle 
has a free time of T. Suppose we observe it at a random time ; on the 
average we find it midway between collisions and, consequently, at a time 
t/2 since its last collision. Thus a random observation will find it with 
velocity ar/2 and not ot. 

(The argument is wrong. If we observe the particle at a random time it 
will, on the average, have been free for a time t and will remain free for an 
additional time r, so that it will have, on the average, a drift velocity at, as 
derived in equation (6) above. But the last sentence, which is correct, 
appears to imply that the mean free time is 2t. The factor of 2 comes 
from a difference in averaging procedures: in (1) of Section 8.3, which 
gives the definition of the mean free time, each mean free time is given 



204 INTRODUCTION TO CONDUCTIVITY AND HALL EFFECT 



[8.6 



equal weight. However, the selection of a sample of mean free times at 
random instants, automatically weights the choice in favor of the long 
mean free times, and, as a result, the average so obtained comes out 2t. 
This weighting process is mathematically closely related to computing t 2 , 
which also leads to "2" by turning out to be 2t 2 .) 

8.6 DISCUSSION OF HALL EFFECT IN PARTICLE LANGUAGE 

The Hall effect has played a decisive role in revealing the mechanism of 
conduction in semiconductors. The reason is that when data from meas- 
urements of both the Hall effect and the conductivity of an impurity semi- 
conductor are combined, it is possible to determine both the concentration 
and mobility of the carriers; whereas, a measurement of the conductivity 
alone give* only a product nn„ or pn v . 

The Hall effect, discovered in 1879, occurs when a transverse magnetic 
field is applied to a conductor carrying current. Under these conditions 
it is found that in addition to the longitudinal electric field normally 
present a transverse field is produced so that the current (which continues 
to flow longitudinally) and the electric field are no longer parallel. The 
effect thus produces a measurable transverse voltage across the specimen.. 
The cause of the effect is that the magnetic field exerts a force on the 
electrons and deflects them sidewise. Before the development of the band 
theory of solids, it was a mystery why the electrons which carried the cur- 
rent were sometimes deflected so as to produce the effects expected of 
positive charges. However, as we shall show, this is just the effect to be 
expected for holes. We shall give first an introductory pictorial descrip- 
tion of the Hall effect in terms of particles. For this purpose, we neglect 
the random velocities of the particles (which are here regarded as classical 
non-wave mechanical particles) by supposing that after each collision they 
are brought to rest so that all of the motion they acquire is produced by the 
applied fields. (It can be shown that the random velocities cancel out on 
the average here, as they do for the case of conductivity just discussed in 
Section 8.5.) Under these assumptions, the motions of positive and nega- 
tive particles in combined electric and magnetic fields will be as shown in 
Figure 8.7. A positive particle starting at the origin will be accelerated 
by the electric field. As it gathers speed, it is subjected to a sidewise 
thrust by the magnetic field according to the vector formula 

F = {e/c)v X H. (1) 

This deflects it along the curved path until it suffers a collision and starts 
over. As a consequence, the current and electric field deviate by an angle 
6 P , as shown. An electron is deflected to the same side: it acquires a 
velocity opposite to a hole but since the thrust of the magnetic field is 
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proportional to velocity times charge (which is also reversed compared to a 
hole), the sidewise thrust is in the same direction. The current due to 
electrons deviates in the opposite direction _ at an angle of 0„ from the 
electric field. The angles of deviation can readily be obtained on the basis 
of this model, and are given by the equations 

6 P = eTpH/mc = UpH/c, e n = er n H/mc = p n H/c (2) 

when absolute e.s.u. and e.m.u. are employed. We shall omit the deriva- 
tion here since we shall obtain the same formula more simply in the next 
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VECTOR PRODUCT, VxH: FOR HOLES/ FOR ELECTRONS- 

p IG 8_7— Electron and Hole Current in Relation to Electric and Magnetic Fields. 

section. This formula is familiar in connection with magnetron theory, 
eH/mc being the angular velocity of an electron in a magnetic field. 

As will be described later, the angle 9 (or its equivalent), together with 
the direction of the deviation between electric field and current, can be 
found in a Hall effect experiment. This leads to a determination of the 
sign of the carriers and an evaluation of n from (2), which can then be 
combined with the conductivity data to find the concentration of carriers. 

We shall next show that the Brillouin zone theory also leads to a devia- 
tion between current and electric field and to the same formula for 0. 
After that the Hall coefficient will be defined, the experimental procedure 
for measuring it will be described and the results re-expressed in practical 
units. 
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8.7 BRILLOUIN ZONE TREATMENT; HALL, DRIFT, AND 
MICROSCOPIC MOBELITIES 

We next consider the effect of applying electric and magnetic fields simul- 
taneously. This is shown in Figure 8.8. We consider in (a) and (b) the 
case where only a few electrons are present in the Brillouin zone, cor- 
responding to an »-type semiconductor. Under equilibrium conditions, 
with no field present, these electrons will occupy states distributed near the 
center of the zone. As discussed in connection with Figure 8.4, we have 
indicated the electrons by shading and have not attempted to show the 
gradual dropping off of the concentration from the lowest energy values at 
the center of the zone to higher energy values, but have instead drawn the 
distribution as though all energies up to a certain state were equally 
occupied and all those beyond that were unoccupied. For the purposes of 
the qualitative exposition in this paragraph and the next, no error is intro- 
duced by this procedure. (For the analytic purposes, employed later, we 
do not refer to the figure.) In part (a) of this figure we show the effect of 
an electric field alone. The electric field is supposed to act in the plus 
^-direction so that the force on the electron acts in the minus direction. 
As previously discussed, this results in a shift of electron distribution so 
that the steady state value will be as shown in the shaded area. If all the 
quantum states in the shaded area were filled and those outside were 
empty, this shift would effectively add the electrons corresponding to 
region A and subtract those corresponding to region B. Since the dis- 
tribution of electrons is diffuse, however, the net result is merely to increase 
the number of electrons in quantum states in the Brillouin zone near region 
A and to decrease the number near region B. Now, as the figure indicates, 
the velocities near A have a component to the left and those near B, to the 
right, so that the net result of these changes is to unbalance the velocity 
distribution and produce an average momentum P to the left for the 
electrons, as is shown. Since the electron has a negative charge, this is 
equivalent to a current to the right in the direction of the applied field. 

When a magnetic field is also applied, all of the quantum states move in 
accordance with Figure 7.3(b), as previously discussed. This motion 
shifts the group of points in A counterclockwise on the diagram as indicated, 
and the distribution will be distorted into the form shown in Figure 8.8(b) ; 
the average velocity vector will, thus, make an angle with the field, as is 
shown on the figure. From this figure, it is easy to understand why the 
angle between the current and the electric field is given by the magnetron 
frequency times the mean free time. If an electron suffered no collisions, 
then in the presence of crossed electric and magnetic fields its velocity 
would run through a complete cycle of directions in the period correspond- 
ing to the magnetron frequency. This means that if the electric field were 
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suddenly removed and collisions were prevented, the shaded circle would 
rotate about the center of the diagram with the magnetron frequency, as 
has been described in connection with Figure 7.S. This process is termi- 
nated in a mean life time t so that, as we shall show analytically below, the 
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F IG . 8-8— Effect of Electric Field, E x , and Magnetic Field, H„ on Electron and Kfcle 
Distributions in the Brillouin Zone. 

effective angle through which the distribution is rotated is T times the 
magnetron frequency. 

The formula for the angle between current and electric field can readily 
be derived analytically from assumptions (1) and (2) of Section 8.3. 
According to these, if the total current at any instant is /, then it will 
decrease in time dt due to collisions by an amount Idt/j, since during dt a 
random or representative fraction dt/r of the electrons is restored to the 
thermal equilibrium state. At the same time, each electron is changing 
its current because of the accelerations produced by the applied fields. 
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We shall calculate the change in current due to these fields by summing the 
changes for each electron. (The procedure is essentially the same as that 
used in deriving equation (4) of Section 8.4.) If we multiply equation (16) 
of Section 7.6, which expresses / in terms of /, E, and H, by dt so as to find 
dl, and suppose that this dl corresponds to electron "1" of the n electrons 
we are dealing with, we obtain 

dh = {f?lmV)Edt - hX H(e/mc)dt. (1) 

Summing over the « electrons and taking V as unit volume, we obtain 

dl = dl x + dh + h dl n = {ne 2 /m)Edt -IX H{e/mc)dt (2) 

for the change in / due to the acceleration of the fields during time dt. 
(The reader should note that the effect of random motions is not apparent in 
this equation, since only the total current / occurs in it.) The change due 
to collisions is 



dl = -Idt/r. 
For the steady state the sum of these changes must be zero, giving 
(n<?/m)Edt -IX H(e/mc)dt - Idt/r = 



(3) 



or 



/ = {ne 2 r/m)E - / X H(er/mc). 



(4) 



In order to simplify the algebra involved in interpreting this vector equa- 
tion, we shall take the ,*-axis as parallel to / (rather than to E) and shall 
compute the direction and magnitude of E. For these conditions the 
vector product I X His thus parallel to_y with magnitude —I x H Zi H being 
parallel to the 2-axis as before. Thus the x and y components of equation 
(4)_of Section 8.7 are: 

I x = (ne Z T/m)E x = oE x 



= <rE„ 



(—I x Hz)(er/mc) 



(5a) 
(5b) 



or 



E x = I x /(n<?T/m) = I x la (6a) 1 

E v = -{I x /<r){nHJmc). (6b) 

This shows that the£ vector is turned clockwise in the x-y plane in respect 

1 Since no approximations have been involved in dealing with the consequences of equation 
(16) of Section 7.6 and assumptions (1) and (2) in deriving (6a), we conclude that these 
assumptions lead to the result that the power dissipation I X E X due to current I x is independent 
of//. This shows that magnetoresistance (an increase in resistance in magnetic fields) cannot 
be derived from these assumptions. Magnetoresistance, theory shows, is connected with 
variations in mean free time of the sort excluded by assumption (1) or with variations of the 
energy surfaces from spherical form. See Chapter 12. 
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to the / vector. For small angles this will give 

6 = Ey/Es = erHJmc = nH./e. (7) 

This is the same result which was discussed in connection with the particle 
treatment and shown in Figure 8.7. 

The only effect of using holes instead of electrons is, as pointed out at 
the end of Section 7.6, to change the coefficient of the H, term and thus 
reverse the rotation. Of course, the values of t and m appropriate to the 
carriers in question should be used. The diagrams showing the displace- 
ment of electrons in the Brillouin zone, however, have quite a different 
appearance and are shown in Figure 8.8(c) and (d). As explained for 
Figures 7.3 and 7.5, the rotation produced near the top of the energy band 
is opposite to that near the bottom, thus accounting for the opposite rota- 
tions for electrons and holes of (b) and (d) of Figure 8.8. 

In this treatment we have stressed the Hall angle 6 rather than the Hall 
constant R H . The reason is that the Hall constant, which we shall discuss 
in the next section, measures the number of carriers, whereas 0, or rather 
0/H, measures the mobility; and the latter quantity is much more funda- 
mental in semiconductors, in which the number of carriers is a highly 
structure-sensitive property, depending as it does on the impurity content. 
Furthermore, d/H is the more readily measured quantity. (It will have 
the same value even if the y% cross-section of Figure 8.9 discussed in the 
next section is irregular or unknown in shape.) In order to describe the 
mobility as measured by the Hall effect, we introduce the term Hall 
mobility and the symbol 

Q 

liji — c — - = Hall mobility. (8) 

H 

In general, the Hall mobility will not be equal to the mobility required in 
the expression for conductivity. It has been possible only recently to 
measure the mobility in terms of drift velocity by using the techniques 
described in Section 3.1. We shall introduce the term drift mobility defined 
as the drift velocity of injected carriers divided by electric field. 



average drift velocity , .. .... 

MD = f r-^r-, L = drift mobility. 

electric held 



(9) 



The drift mobility as measured by the transmission velocity of a hole pulse 
in a germanium filament may actually be different in concept from the drift 
velocity discussed in earlier sections of this chapter; if the semiconductor 
contains levels capable of binding holes tightly enough so that they are free 
to move for only a fraction of the time, then the drift mobility will be only 
a fraction of the microscopic mobility, which we define as the mobility of an 
untrapped particle. The drift of color centers in alkali halide crystals is an 
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example of this sort, and the drift mobilities are very low, being of the order 
of 10" -4 cm/sec per volt/cm. 2 Evidently such trapping effects will reduce 
the drift mobility but will not affect the microscopic mobility or the Hall 
mobility. 3 Trapping by donors and acceptors in germanium at room 




O) VOLTAGE COMPONENTS FOR CONDUCTION 
BY ELECTRONS IN A MAGNETIC FIELD 




(b) VOLTAGE COMPONENTS FOR CONDUCTION 
BY HOLES IN A MAGNETIC FIELD 

Fig. 8-9— Transverse Field Due to Hall Effect. 

temperature will be negligible, as is discussed in Chapter 10; in silicon at 

2 See N. F. Mott and R. W. Gurney, Electronic Processes in Ionic Crystals, Oxford Uni- 
versity Press, 1940, p. 141, for a review of the work of R. W. Pohl and his colleagues on this 
subject. See also F. Seitz, "Color Centers in Alkali Halide Crystals", Rev. Mod. Phys., 18, 
384-408 (1946). 

3 The earliest experiments of which the author is aware in which a comparison of drift and 
Hall mobilities were made for the same particles were carried out during some unpublished 
work with photoelectrons in silver chloride by J. R. Haynes, using techniques described by 
J. R. Haynes and W. Shockley, "Report of a Conference on Strength of Solids", The Physical 
Society, London, 1948, p. 151. No evidence for trapping which reduced the mobility was 
found at room temperature. 
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room temperature and in germanium at lower temperatures, however, 
trapping should produce distinguishable differences between the drift and 
Hall mobilities. In Chapter 12, a review of mobility data is presented. 

It may be worth while to point out the relationship of drift and micro- 
scopic mobility to conductivity. Suppose we have an K-type semicon- 
ductor with a certain density of excess electrons which are continuously 
and frequently becoming trapped and then free again. Under these 
conditions the conductivity will correspond to the entrapped density so 
that we may write 
a = e X (microscopic mobility) X (density of untrapped excess electrons) 

= ettD X (total density of excess electrons). 

Thus either of the mobilities may be used equally well provided that it is 
properly paired with the appropriate density. This sort of a determination 
of n from conductivity and total donor density is discussed in Section 12.9. 
In the next section we shall describe the standard technique for carrying 
out a Hall effect experiment and shall convert the formulae from the abso- 
lute electromagnetic system employed here to other units. For this pur- 
pose, we shall not distinguish between the different definitions of mobility. 
In the following section we shall discuss the difference between micro- 
scopic mobility, denoted simply as n, and Hall mobility and quote results, 
derived later in Chapter 11, relating the two. A further discussion of this 
difference is given in Section 12.9. 

8.8 MEASUREMENT OF THE HALL EFFECT- 
PRACTICAL AND M.K.S. UNITS 

The Hall effect is measured by using samples similar to those illustrated 
in Figure 8.9; in these the current flows parallel to the long dimension, 
which is supposed to be large compared to the transverse dimensions. 
When a magnetic field is applied to such a sample, it deflects the current 
carriers to one side and causes the current to deviate from the direction of 
the electric field. During the initial transient phase, while the magnetic 
field is building up, a transverse current actually flows in the specimen and 
carries charge across it from one side to the other. This charge accumu- 
lates on the two opposite faces, making one positive and the other negative, 
and sets up a transverse electric field. After steady-state conditions have 
been established, this trarisverse field just balances the deflecting effect of 
the magnetic field and no further charge accumulation on the surfaces takes 
place. The current then flows parallel to the long direction (#-axis) in 
Figure 8.9. The electric field is not parallel to the x-axis and has a jy-com- 

1 For a discussion of the Hall effect in short samples see I. Isenberg, B. R. Russell, and 
R. F. Greene, Rev. Set. Inst. 19, 685-688 (1948). 
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ponent. In accordance with the results of the last section, the angle 
between the current vector / and the electric field vector E is given by 
equation (7) of Section 8.7. Part (a) of Figure 8.9 shows relative orienta- 
tions ofE, I, H and the specimen for the case of electrons and part (b) for the 
case of holes. Because is a small angle, 2 we may replace its cosine by 
unity and its sine and tangent by 6. (For brevity we shall omit the subscripts 
y and zjrom I y and H t .) As discussed previously, 6 = erH/mc = y.H/c. 
Also because 6 is small, we may write I = cE x for the current density. 
Accordingly, the expression for transverse field may be simplified to 

E y = BE X = ei/tr = (±n/ca)HI (1) 

where the plus sign holds for holes and the minus sign for electrons. 

The Hall coefficient Rh is by definition the coefficient of HI in the 
foregoing equation. It gives the ratio of the transverse electric field to the 
longitudinal current density times the magnetic field. It is given by the 
equation 

R H = 0/cH = ±n/cc = \/n(±e)c (2) 

and is thus seen to give a direct measure of the number of carriers present 
and also their sign. (If both holes and electrons are present the more 
complicated expression, equations (10) of Section 8.9, is obtained.) 

Usually the Hall coefficient is measured in terms of the transverse voltage 
produced by the transverse field between two directly opposite points on 
the sample. Also the total current through the slab, denoted by It, 
rather than the current density /, is used. These quantities are related to 
those in the previous equation as follows: 

I T = IWT, V 2 -V l = E y W (3) 

where W and T are the transverse dimensions of the sample. Combining 
these with the previous equation we obtain 

Vi-Vi- RhHIt/T or R B = (V 2 - V{)T/HI T . (4) 

These equations are expressed in the absolute c.g.s. system of units so that 
currents and voltages are to be measured in electrostatic units. Except 
for this, the form given in the last equation is that used to measure Rh in 
practical units. As defined in connection with the directions shown in 
Figure 8.9, the sign of the Hall effect determines whether or not the con- 
duction is by electrons or holes. We shall discuss the case where both holes 
and electrons are present in Section 8.9. 
Practical Units. We must next convert the Hall Effect formulae to the 

2 At least so far as first-order effects alone are concerned. 
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units ordinarily used in the laboratory: 

cm, gram, sec/or mechanical quantities, 
volts, coulombs for electrical quantities, 
gauss or oersted for magnetic field. 

The subscript L, for laboratory, will be used for quantities measured in 
these units. Summarizing the previous results, which are expressed in 
c.g.s. units and absolute e.s.u. and e.m.u., we have 

R a = B/vH = l/n(±e)c (5) 

c\R b \<f = » = erjm (6) 

t = mc\Rn\<r/e. 0) 

The Hall coefficient in practical units, denoted by R L , is expressed in 
cm 3 /coulomb, that is, the reciprocal of a charge density expressed in 
coulombs per cubic centimeter. This can be obtained from equation (5) 
by eliminating the speed of light, c, and multiplying by a factor to convert 
absolute electrostatic units of charge to coulombs. Denoting by e L the 
charge on the electron in coulombs we have e = ffi/10 in accordance with 
the customary definition of the ratio of these units. This leads to the 
following relationship between practical and electrostatic Hall coefficients: 



Rl = 



n(±e L ) n(±e)lO 10 n(±e)c 



= 9X W 19 R H 



(8) 



where we have used the customary approximation that c = 3 • 10 cm/sec. 
In order to determine R L from practical measurements, we shall first see 
how R H is obtained when currents and voltages are measured in practical 
units. Denoting the total current through the specimen by I L we then 



{V» 



and 



V x ) = (F 2 - rOi/300, It = 3 X 10 9 //, 
{V 2 - V{)T 



R 



HI T 



{F 2 - V X ) L T 
9 X \0 n I L H 



(9) 



(10) 



so that we get 

R L = \rf(V 2 - Ft) L T/HI L . (ID 

Equation (11) defines the value of R L in cm 3 /coulomb in terms of measured 
quantities. Using the relationship E yL = iVz - V\)hiW and E xL = 
IlIoiWT, one may rearrange equation (10) to obtain the angle between 
electric field and current 



= E v l,/E x l — 



{V 2 - Fi) L Ta L 



= \QT*RlclH. 



(12) 
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Since the conductivity in practical units is equal to the charge density of 
carriers in practical units times the mobility in practical units, we have an 
easy way of obtaining the mobility in practical units as follows: 

ol = newh = \ilI\Ri\ ohm -1 cm -1 ; ei, = 1.60 X 10 -19 coulombs (13) 

V-l = <tl\Rl\ cm 2 /volt sec. (14) 

This result for hl, which leads to the useful formula 

9 = \0-*»lH, (15) 

can be converted to an expression for the mean free time by using p = er/m. 
Equation (14) must then be multiplied by 300 to convert it to mobility per 
electrostatic volt, and we thus obtain 

t = (300m/e)ui L = 5.7 X 10- lf Vz, sec = 5.7 X \<T u R L <x L sec. (16) 

At room temperature, the thermal velocity of an electron is 10 7 cm/sec 
(provided its effective mass is that of a free electron). Hence the average 
distance it travels between collisions is 

/ = 10 7 t = 0.57 X 10r*R L v L cm = (0.57 w ) A (17) 

where A = 1 angstrom = 10 -8 cm. This gives a simple relationship 
between mobility and mean free path applicable at room temperature. 
(Mean free paths for silicon are discussed further in Chapter 11.) 
M.K.S. Units. In M.K.S. units the basic force equation becomes 

F M = e M [E M + v M X B m ] (18) 

where the units are 

Em — volts/meter (19) 

B M = webers/meter 2 (20) 

e M = 1.60 X 10~ 19 coulombs. (21) 

A field of one weber/cm 2 is equal to 10 4 gauss. In terms of these units 
the basic equations become 

om = »msmhm ohm -1 meter -1 (22) 

«m = carriers/meters 3 • (23) 

liM = meters 2 /volt sec (24) 

Rm = I/hm^m = meters 3 /coulomb (25) 

B radians = umBm. (26) 

That the ratio of transverse electric field to longitudinal current density 
times magnetic flux density gives the M.K.S. Hall constant may be verified 
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as follows: 

E y M __ ExM0 _ MM _ 1 ,r)-)\ 

Im^M &xMOMt>M CM »MffM 

Equation (26) shows that mobility has the dimensions of I/Bm = meters / 
weber or meters 2 /volt sec because of the definition of the weber. The ex- 
pressions for t and for / corresponding to (16) and (17) become 

t = 5.7 X VT 12 m sec (28) 

/ = 5.7 X 10~" 7 hm meters = 5700/w angstroms. (29) 

8.9 MODIFICATIONS OF TEE HALL EFFECT FORMULA 

As stated earlier, assumptions (1) and (2) of Section 8.3 are only an 
approximation to the actual situation, in which the probability of a transi- 
tion depends in a complicated way on both the initial and the end state. 
When the variation of this probability is taken into account, it is found that 
a different average mean free time, call it t h , occurs in the Hall angle 

6 = er H H/mc (1) 

and in the mobility (call this average t m ) : 

M — eTy./in. (2) 

For electrons in semiconductors at temperatures so high that thermal 
scattering predominates, the times and mobilities are related by the 
formula, derived as (28c) in Section 11.4, 

3ir 3x i ,„. 

Tff = Y T " and /*# = y • w 

As a consequence, the Hall coefficient, as given by equation (5) of Sec- 
tion 8.8 in terms of measured values 6, <r, and H, is 

_ ±erjiH m _ Tjy 1 _ 3jr ^ 1 _ ,^ 
Rb = ^H = mc nfrpH' r/n(±e)c~ 8 ' n{±e)c' 

This is the formula usually used for Hall effect in semiconductors. For 
simple metals and for very impure semiconductors, which have the electron 
statistics of metals, th turns out to be the same as t,, and the formula is 
R H = 1/nec. These results are derived in Chapter 11. 

When both electrons and holes are" present, the Hall constant is a com- 
plicated average of the Hall constant and conductivity for each type of 
carrier. In Figure 8.10 we show the construction necessary to evaluate 
the Hall constant when both holes and electrons are present. The mag- 
netic field comes out of the paper, parallel to z as for Figure 8.7. Under 
these conditions the hole current I v and electron current 7„ deviate in 
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opposite directions from the electric field. The net current / must, of 
course, have no transverse component across the specimen since such a 
component would quickly set up charges and produce a transverse field 
which would reduce the transverse current to zero. Although the net 
transverse current is zero, the transverse electron and hole currents in- 
dividually will not vanish and there will be a tendency for both holes and 
electrons to be swept to one side of the filament as discussed in Section 3.2. 
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Fig. 8-10— Hall Effect for Both Holes and Electrons. (Magnetic Field in +2 
Direction, toward the Reader.) 

For electric fields so small that no conductivity modulation is produced by 
this effect, the densities of holes and electrons will have their equilibrium 
values and the currents will be as shown in Figure 8.10. Our problem is 
thus to evaluate d from the construction shown in the figure and to use this 
value to determine the Hall constant in accordance with Rh = 0/o7/, 
equation (2) of Section 8.8. By convention, we shall regard an angle like 
0„ as negative. When both types of carriers are present, we have from 
equation (13) of Section 8.4 

c = e(«/u„ + pn p ). (5) 

Using the approximation employed for the Hall effect that cos 6„ — cos 6 P 
= 1, we obtain a current parallel to E of 

I x = In + I P = {ney^ + pep. v )E x . (6) 

The ^-component of current is obtained by the approximation 
sin B n = n and sin 9 P = 6 P and by recalling that the Hall effect conven- 
tion gives Iy/I x = —0 for Figure 8.10. Thus we obtain 

Iy= - (J„0„ + Ijfip). (7) 

The values for 6 n and d p are defined in equation (2) of Section 8.6 as 
$ n = -UnH/c, 6 P = +Hj,H/c, (8) 
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From these equations we find that the Hall angle for the total current is 
. -Iy - (nen n E x nnH/c) + {pen v E x n v H / c) 



h 



(nen n + pen p )E x 
(-mn 2 + Pn P 2 )(H/c) 



n^n + PUp 



(9) 



Substituting this in the formula for R H gives (see also Prob. 11 p. 219) 

{-n^+p^){H/c) 



R H = = 

<rH (nun + pii p ) (ettn„ + epn P )H 

-ny. n 2 + 



-nb 2 +p 

ec{nnn + pl*p) 2 {nb + pfec 



(10) 



where (using Haynes' drift mobility data for Ge, see Section 12.9) 

b = m«/mp(=2.1 for Ge and =3.0 for Si) (11) 

is the ratio of electron to hole mobility, n is the density of electrons (that is, 
number/cm 3 ), p the density of holes, e the charge on the electron, and c 
the speed of light. The corresponding values of Rh in practical and 
M.K.S. units are 



#£ = 



— nb 2 + p 
{nb + pfe h 



R\ 



— nub 2 + Pm 
(n M b + pu) 2 e L 



(12) 



where eL = 1.6 X 10 -19 coulombs. The measured value of Rl is defined 
in equation (1 1 ) of Section 8.8. It may be readily verified that if conduction 
is by electrons only (that is, p = 0) or holes only (that is, n = 0) this 
formula reduces to equation (8) of Section 8.8 or Rl = 1/»(±^l). Mak- 
ing allowance for the difference in th and t„, modifies (12) to the form 



Rl = -~t 



nb 2 



8 {nb + pYe L 



(13) 



This is the form frequently quoted in the literature. For an example of 
the use of this formula in interpreting conductivity data, the reader is 
referred to the paper by Pearson and Bardeen. 1 

One simple consequence of (12) should be mentioned in closing. A 
/>-type sample, which has n<^.p and, consequently, Rl > 0, will, upon 
heating, become intrinsic with n c^ p. Since b is > 1, this will cause a 
reversal in sign of the Hall coefficient at the temperature for which « = p/b 2 . 
This point of reversal is used as discussed in Problem 7 in determining b. 1 

1 G. L. Pearson and J. Bardeen, Phyi. Rev. 75, 865-883 (1949). 
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Problems 

1. From the distribution given by equation (4) of Section 8.3 compute 
the average values of /, / 3 , / 3 , f 4 . 

2. Obtain the general formula t n = » ! t" by considering the integral 
1(a) and its derivatives: 

d"I(a) 



7 <-> -r 



e- tti dt= 1/a; t n = (-l)"a- 



i«" 



3. In the long interval T of equation (1) of Section 8.3 find the total 
time occupied by paths with free times between / and t + dt. This time 
divided by T is the probability that an observation at a random instant 
will select a mean free time with duration between / and / + dt. Prove 
that the average of the mean free times selected in this way is 2t. (Com- 
pare with the discussion at the end of Section 8.5.) 

4. Show that if the free times all had exactly the same length / (this 
would be a violation of assumption (1)), then the drift velocity would be 
vd = —eEt'/2m for electrons. 

5. The lowest energy bands in a solid are very narrow since there is 
hardly any overlap of the wave functions; on the other hand, the high 
bands are relatively wide. Supposing that the mean free time is about the 
same for the valence-bond band and the conduction band, which would you 
expect to have the higher mobility, an electron or a hole ? 

6. Suppose a magnetic field is instantaneously applied to a semicon- 
ductor which is subjected to a constant E. There will be a short time lag 
before the current reaches its new steady state value. What process will 
determine this time ? 

7. The ratio of mobilities b is sometimes obtained from data on ap-type 
specimen by using the temperature To for which Rjj=0, so that nb 2 —p = §. 
For somewhat lower temperatures, all of the acceptors are ionized and p has 
a "saturation" value p, equal to the excess of acceptors over donors and n is 
negligible. As the intrinsic range is approached, holes and electrons are 
produced in equal numbers by the breaking of valence bonds so that 
p = p s -f- n. If the saturation conductivity line is extrapolated to To, a 
conductivity <r e is obtained, whereas the actual conductivity is <r . Show 
that 



£0 



b- 1 



or b = 



ffo 



oo 



<Te 



8. Verify the practical unit formulae of Section 8.8 by deriving them 
from the M.K.S. expressions. 

9. This problem is intended to illustrate some fundamental features of 
magnetoresistance effects. Suppose for this purpose that the electrons in 
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a. crystal can be separated into two classes, with densities n and n', relaxa- 
tion times t and t' and both with the same mass. Show from (6a) and 
(6b) of Section 8.7 that the Hall angles are 



e 

6' 



tan 1 (erH z /mc) 



tan * (er'Hu/mc) = tan" * wt' 



— i 



tan wt 



Carry out a calculation like that shown for Figure 8.10 and show that 
although there is no net transverse currents there are equal and opposite 
transverse currents due to the individual classes. Show that if n't' = »t, 
the resistance, defined as power dissipation per unit volume for unit current 
density, varies as 

4 + 2(<ot) 2 + 2(<ot') 2 
4 + («t + «t') 2 

Show that if t = t' this ratio is unity for all values of H z . However, if 
t' = 10t it varies from unity to 202/121 = 1.67 as H z varies from to .«. 

10. Analyze the side thrust on an w-type filament carrying current 
in a magnetic field. Consider the fact that the net force per unit volume 
due to the transverse field E y upon the electrons must be equal and opposite 
to that on the donors, which are mechanically attached to the filament. 
The electrons also transfer momentum to the filament by collisions. Show 
that the condition t m = Th corresponds to the case in which collisions trans- 
fer no net transverse momentum to the filament. The problem of visualiz- 
ing the details of momentum transfer for the electrons of an almost full 
band so as to establish the mechanism for the force on holes is more difficult 
but can be done by methods like those of Section 7.6. This problem also 
shows that the procedure sometimes employed to obtain the Hall angle 
formula by equating eE y to ev x Hi/c or to evJ&Mx is not fundamental since 
it neglects momentum transfer by collisions. 

11. Show that if bo is defined as the ratio of drift mobilities (Section 8.7) 
and bs as the ratio of Hall mobilities: 

bD = V-Dn/f-Dp, bu = PHn/PHp 

then the Hall coefficient [equation (10) Section 8.9] becomes 
Rs = (nHp/p.D P )(p - nb D b H )/(p + nb D ) 2 ec. 
Show that in Problem 7, the result becomes 

cj(r e = b D {\ + b s )/(b D bH — 1). 



CHAPTER 9 

DISTRIBUTIONS OF QUANTUM STATES 
IN ENERGY 

9.1 ENERGY LEVEL DIAGRAMS FOR PURE CRYSTALS 

For the purpose of dealing with the equilibrium distribution at various 
temperatures of the electrons among the quantum states of the crystal, it is 
necessary to consider the distribution in energy of the quantum states. 
This quantity, denoted by N(&), is denned in the following way: Consider 
a crystal of unit volume. Suppose that a complete list of the quantum states 
and their energies were prepared for all the allowed quantum states in all the 
Brillouin zones of the crystal. From this list, select and count all the 
quantum states whose energies lie in some particular interval 6 to & + d&> 
of energy; call this number dS. Then N(&) is defined by 

N(&) = dS/d& or dS = N{&)d&. (1) 

Thus we have 

Definition of N(&): N(&) is the number of quantum states per unit 
energy per unit volume of the crystal. 

In accordance with the theorem of the conservation of quantum states 
discussed in Sections 5.3 and 5.5, the number of quantum states in the 
Brillouin zone is simply proportional to the size of the crystal while the 
width of the energy bands is independent of the size of the crystal (that is, 
the energy band width theorem, Section 5.3). Thus, the number of 
quantum states per unit energy range for a crystal of volume V is simply 

VN{&). 

If the energy £ falls in a forbidden band of energies, that is in a gap 
between the allowed energy bands (see Sections 5.3 and 5.6), the value of 
2V(5) is zero. If S falls within an energy band, then the value of iV(6) 
may be obtained in principle as follows: Corresponding to the energy band 
there will be one or more Brillouin zones. In each of these the values 6 
and & + d& define two closely neighboring energy surfaces. In any 
particular zone there will thus be a certain volume dV P of P-space between 
the two surfaces. The volume per allowed point in this space is h s /V, 
where V is the volume of the crystal (see Sections 5.2, 5.5, and 14.8, and 
Figure 5.10), and each such point gives rise to two quantum states, one 
with each spin. Hence, the number of quantum states in the range d& 
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per unit volume of crystal will be 

2dV P /{h*/V) 

do — — 



{Vh 3 )dV P 



(2) 



If several zones have overlapping energies, then the total number of states 
in & to S + d& is obtained by adding the contributions of the individual 
zones. In connection with the theory of conductivity, it is found advan- 
tageous to re-express dV P in terms of the element of area of the energy 
surfaces, which has the dimensions of P 2 and is denoted by dQ,, and the 
energy difference d& between two neighboring surfaces. The formula is 

dV P = dSda/v (3) 

where v = \v\ = |Vj>§(P)| is the speed associated with P at the point of 
interest. The derivation of (3) will be found in Section 11.2 in connection 
with equation (5), in which the formula is used; it is repeated here for 
ease of reference. 

As an example, we consider the number of quantum states per unit 
energy near the bottom of an energy band. As discussed in Sections 7.5 
and 7.6, we may use the approximation that near the top or bottom of an 
energy band, the energy varies as P 2 or P' 2 where P = \P\ and P' = |P'|. 
For simplicity, we suppose the lowest energy is at the center of the zone so 
that according to equation (11) of Section 7.5 

g = S + P 2 /2m. (4) 

From this we obtain 

d& = (P/m)dP. (5) 

The volume of the spherical shell between P and P + dP in P-space is 
4irP 2 dP and it, therefore, contains 

dS = (2/A 3 )4jtPVP (6) 

quantum states. This can be expressed in terms of the energy by solving 
(4) for P and (5) for PdP obtaining 

P = [2»»(& - S )] H (7a) 

and 

PdP = md&. (7b) 

These may be substituted in equation (6) to give 

dS - («tt/A 3 )P 2 ^P = (4x/A 3 )(2^) ?4 (S - 6o)^« = N(&)dS (8) 

leading to the desired value for N(&): 

AT(S) = (4*-A 3 ) (2»0^(6 - S ) H . (9) 

This equation gives the number of quantum states per unit volume of the 
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crystal per unit energy range for energies slightly above & , the minimum 
energy of the Brillouin zone. The total number of states having energies 
less than a certain maximum value & m is obtained by integrating N(&)d& 
from &o t0 ^»» : 



«/E0 



on 



-5 )^ = 4.5X10 21 (^-So)^ (10) 



the numerical value giving number per cm % for energy in electron volts. 
This formula is frequently used in the theory of metals in which it is 
assumed that the energy band has the form (9) and that all the states 
below a certain energy are occupied. Equation (10) can then be used to 
calculate the energy of the highest occupied state. 

Entirely similar results will be obtained for the distribution of states 
near the top of a band. In this case, equation (4) of Section 7.6 or 
g = g _ p 2 /2m is used, and equation (9) is modified by replacing 
(S — S ) H by (S — S) 54 . It should be mentioned that these results are 
dependent on the same assumptions about the simplicity of the energy 
surfaces as those employed in Section 7.5. 

In Figure 9.1 the behavior predicted from equation (9) and certain 
general considerations have been combined to show the qualitative features 
to be expected for the distribution of quantum states in energy for diamond. 
We consider a unit volume of crystal containing N a atoms. Three energy 
bands are shown. The lowest corresponds to the Is atomic level and is 
occupied by the two core electrons per atom, shown for the carbon atom in 
Figure 1.2. Since, as discussed in Section 5.5, the diamond structure has 
two atoms per unit cell, two Brillouin zones are required to accommodate 
the four core electrons per unit cell. The eight valence electrons per unit 
cell similarly require four Brillouin zones which overlap to give rise to the 
valence-bond energy band. If we assume that equation (9) holds near 
the edges of the bands [a form similar to equation (9) should always hold, 
provided m is replaced by a suitable effective mass], then the distribution 
should appear as in Figure 9.1 (a). Energy is plotted upward in this figure 
so as to correspond to the energy band picture in (b). The number of 
states in the valence-bond band is 



S = f^ N($)d&, 



(11) 



an element of the integration being shown by the shaded area. The core 
electron band is narrow in energy and thus has a high value of N(&). 

The distribution of quantum states in the conduction band has been 
shown as continuing indefinitely to higher energies. This is in keeping 
with the idea that the higher atomic levels, which produce the conduction 
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Fig. 9-1 — Distribution in Energy of the Quantum States. 

energy bands, interact so strongly that the resulting energy bands cover all 
possible higher energies, leaving no gaps. 

Figure 9.1 is appropriate, so far as the valence-bond band and bottom 
of the conduction band are concerned, to pure silicon and germanium. 
However, it must be modified when impurities are present. 



9.2 ENERGY LEVELS DUE TO IMPURITIES- 
ACCEPTOR LEVELS 



-DONOR AND 



We shall consider first the case of a donor impurity, for example, a 
phosphorous atom inserted in silicon. As discussed in Chapter 1, an atom 
like arsenic or phosphorus has one too many plus charges for its share of 
the four valence bonds. Therefore, it constitutes a positive charge in the 
lattice. It also brings with it an extra electron which neutralizes this 
charge but cannot be a part of the valence-bond structure. This extra 
electron uses wave functions in the conduction band. As we have seen, an 
electron near the bottom of this band behaves in the crystal in much the 
same way as an electron behaves in free space, except that the electron in 
the crystal may have an effective mass different from that of an electron in 
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empty space. In accordance with this picture, we might expect it to behave 
in the presence of the impurity much as a free electron would behave in the 
presence of a positive charge. Detailed considerations of the wave mechan- 
ical nature of this problem shows that this picture is essentially correct. 1 
The electron moving in the conduction band has modes of motion around 
the impurity atom essentially like those which an electron in free space has 
around a proton in a hydrogen atom. There are two important differences: 
In the case of the hydrogen atom, the electron is attracted by the charge of 
one proton. In the case of the phosphorous impurity, the electron is 
attracted by the same charge. However, this charge is embedded in a 
dielectric medium. Because of the polarizability and dielectric constant 
of the semiconductor, the attraction of the donor impurity for the electron 
is reduced from e 2 /r 2 to e 2 /nr 2 where k is the dielectric constant, e the 
electronic charge, and r the distance of separation. The other essential 
difference is that the effective mass is not that of a free electron. This 
effect is probably somewhat less important. As a consequence, the wave 
functions for the electron about the donor impurity are like hydrogen wave 
functions but for an atom with a much smaller effective nuclear charge. 
The well-known formula for the binding energy of a single electron to a 
nucleus is given by 

2ir 2 m*e i Z 2 /h 2 = 1 3. 6Z 2 electron volts. (1) 

In this formula e is the charge on the electron, m* its effective mass which 
we take as the free electron mass for purposes of calculation, h is Planck's 
constant, and Z is the nuclear charge. For the semiconductor case, Z is 
equal to 1/k. Since k is 11.9 for silicon 2 and 16.1 for germanium, 2 the 
binding energy is 0.10 electron volt for silicon and 0.05 electron volt for 
germanium. The radius of the wave function or electron orbit is also 
increased by a factor k and thus is so large that it overlaps many atoms as 
shown in Figure 1.14. 

The formula for the hydrogen-like wave function is 

1^ h 2 

e- r "°S «o = ^ (2) 

J me 

where r is the distance from the ion and a is the radius of the first Bohr 
orbit and has the value 0.529 X 10~ 8 cm = 0.529 A (angstroms). 

This bound state is removed from the other states in the conduction band 

1 G. Wannier, Phys. Rev. 52, 191-197 (1937); S. R. Tibbs, Trans. Faraday Soc. 35, 1471- 
1484 (1939); S. Peckar, /. Phys. U. S. S. R. 10, 431-433 (1946); H. M. James, Phys. Rev. 76, 
1602-1610, 1611-1624 (1949)"; J. C. Slater, Phys. Rev. 76, 1592-1601 (1949); of which the 
last is most pertinent. 

2 H. B. Briggs, Phys. Rev. 77, 287 (19S0). See also J. F. Mullaney, Phys. Rev. 66, 326-339 
(1944), H. B. Briggs and W. H. Brattain, Phys. Rev. 75, 1705-1710 (1949), M. Becker and 
H. Y. Fan, Phys. Rev. 76, 1530-1531 (1949). 
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by about the interval of 0.10 electron volt for silicon. As a result, each 
donor impurity removes a state from the conduction band and establishes 
it as a quantum state of lower energy. For most practical cases, the total 
amount of impurity present is very small compared to the total number of 
atoms, so only a very small fraction of the quantum states in the conduc- 
tion band is reduced to lower energies. The hydrogen atom has, in addi- 
tion to its lowest state, a series of higher energy levels, and these will be 
present around the donor impurity also. However, these levels will lie 
very close to the conduction band, and electrons will be so readily excited 
from them into the conduction band that it is not necessary to consider 
these higher levels further for the applications with which we are concerned. 

The donor impurity atoms will have a similar effect upon states in the 
valence-bond band, and one state at the bottom of the valence-bond band 
will be moved to a lower level for each donor atom present. States at the 
bottom of this band are, however, without interest because, at all tempera- 
tures met with in semiconductors, these states are fully occupied. 

Acceptor impurities, however, do have an important effect upon the 
states in the valence-bond band. These impurities represent a negative 
charge since their atomic cores are insufficiently charged to neutralize the 
four electrons in their share of the valence bonds. Consequently, they can 
attract a positive charge. As we have seen, a hole in the valence-bond 
band has many of the attributes of a positive charge. Quantum-mechani- 
cal theory shows that a hole in the valence-bond band, in addition to 
behaving as a positive charge for Hall effect and conductivity, will have 
modes of motion around an acceptor impurity of a type entirely analogous 
to the motion of an electron around a donor impurity. As a consequence 
of this, some of the states in the valence-bond band are raised to higher 
levels by the presence of acceptor impurities. 

In Figure 9.2 we show the effect of acceptor and donor impurities on the 
quantum-state distribution. We consider unit volume of material so that 
the numbers of quantum states are, in effect, densities (number per unit 
volume). The valence-bond band is represented as having 4N„ quantum 
states, where, as before, N s is the number of atoms per unit volume of the 
crystal. There are Nd donor atoms present and these depress Nd states 
from the valence-bond band and from the conduction band to lower levels 
as described above. Owing to the two values for the spin, these levels 
actually are 2A^ in number. However, we must take into account the 
fact that, if an electron is in a quantum state around one of the donor atoms, 
it neutralizes this atom so that another electron cannot be trapped there 
with the same binding energy. This means in effect that only Nd rather 
than 2Nd of the states are available. (This feature of the effect of spin 
upon the statistics of electrons in quantum states is described only approxi- 
mately by the foregoing method. However, for purposes of this expo- 
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sition it is not necessary to use a more exact treatment. See Problem 2, 
Chapter 16.) Similarly, the N a acceptor atoms remove N a states from the 
valence-bond band, leaving a total of 4N, - N d - N a quantum states in 
the band. As is shown in the figure, Nd states are pushed out of the 
valence-bond band to the lower level, as previously discussed. The nota- 
tion for the energy levels uses the subscripts corresponding to conduction, 
donor, acceptor and valence-bond. 
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Fig. 9-2 — Energy Level Diagrams. 

(a) Quantum state distribution including effects of donors and acceptors for an 
w-type semiconductor. 

(b) Energy level diagram showing locations and charges of impurities for a sample 
with balanced impurities. 

(c) As for ib) for an «-type sample at low temperature. 

Diagrams like Figure 9.2(b) and (c) are frequently employed to describe 
and analyze situations in which the distribution in space as well as energy is 
important. In these diagrams the © and symbols represent energy 
levels at donors and acceptors and also the net charge. The charge is 
defined for the situation in which the valence-bond structure is complete 
and no excess electrons are left over. For this case each donor represents 
an excess positive charge. The donor symbol is located at the energy with 
which the donor can bind an electron and is thus somewhat below the edge 
of the conduction band. Electrons in the conduction band and trapped in 
donors are represented as in Figure 9.2(c). A hole, due to an electron 
missing from the valence-bond band, would similarly be represented by a 
plus sign, and, since holes tend towards higher levels in diagrams like 
Figure 9.2(c), a hole trapped to an acceptor would be represented by a plus 
sign next to the acceptor symbol. It is evident from these definitions that 
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electrical neutrality for diagrams like Figure 9.2(c) requires the vanishing 
of the sum of the charges shown. Diagrams of this sort are used in 
Chapters 4 and 12. 

For high densities of impurities, new effects set in, which are associated 
with pronounced overlapping of the wave functions of the impurity atoms. 
This situation is closely analogous to those discussed in Section 5.3 in con- 



Table 9.1 Energies in Electron Volts 







Silicon 


Germanium 


S c ~ B| 


Experiment 


1.12- 3X 10- 4 T a 


0.72 s 


&c— Sd 


Theory 


0.08" - 0.054" 


0.05" 


S — S» 


Theory 


0.08" 


0.05" 


S c — S<J 


Experiment 


0.06* 


0.013' 


So — S„ 


Experiment 


0.075" 


0.013' 


So ~ S„ 


Bombarded Centers 


. . . 


0.046 ±0.01' 
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° G. L. Pearson and J. Bardeen, Phys. Rev. 75, 865-883 (1949). This value was deduced 
from an extensive analysis of the temperature dependence of pn. See Section 16.3 for a dis- 



cussion. 
b 



Unpublished estimate by J. Bardeen. 
'Using m = free electron mass, k = 13 for Si, k = 16.1 for Ge in eq. (1). 

Reference (a), based on an electron mass m n = 0.67m. 
e Reference (a). See also Figure 9.3 and text. 

'Preliminary analysis of early data by G. L. Pearson and W. Shockley, Phys. Rev. 71, 
142 (1947). 

' Preliminary results by G. L. Pearson. 



nection with bringing atoms together to form a solid. As was shown in 
Figure 5.4, when the atoms approach close enough, appreciable splitting of 
the atomic energy levels occurs, and it becomes appropriate to deal with 
the electrons as being in wave functions corresponding to states in the 
Brillouin zone. A similar effect will occur with the wave functions around 
the impurities. As the impurity density becomes higher, the wave func- 
tions overlap more and finally form a band of energies. Electrons in these 
states will then conduct in the same way as do electrons in a metal. In 
fact, the situation closely corresponds to that of an alkali metal which may 
be thought of as an array of small positive ions neutralizing a space charge 
of electrons moving in free space. In the semiconductor there is a random 
array of donor (or acceptor) ions neutralizing a space charge of electrons 
(or holes) moving, not through free space, but in the manner appropriate to 
their states in the Brillouin zone. Evidence for this type of behavior is 
found in both n- and />-type silicon and germanium. Samples of high 
impurity content have been observed to have substantially metallic 
behavior with numbers of carriers which are independent of temperature at 
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temperatures below the intrinsic range. Examples of this behavior are 
shown in Figures 1.10 and 1.11 and are discussed further in Chapter 11. 
On the basis of analyses of Hall effect and conductivity as a function 
of temperature, values have been determined for a number of the energy 
parameters discussed in this section. These are quoted in Table 9.1 
together with explanatory remarks and references. The data for germa- 
nium is of a preliminary character. 




IMPURITY CENTERS /CM 

p IG _ 9-3— Dependence of Ionization Energy of Impurity Centers upon Density. 

Pearson and Bardeen find that for silicon the ionization potentials 
S c - § d and & a — & v vary with impurity concentration in the way shown 
in Figure 9.3. They interpret the decrease with increasing concentration 
as being due to the electrostatic attraction of other donors for an electron 
which has escaped from its own donor. This attraction reduces the average 
energy of an electron which has escaped from its own donor so that the 
ionization potential for its escape is reduced. They find that the data for 
holes in Figure 9.3 can be fitted by the formula 

g o _ S B = 0.08 - 4.3 X \CT*N a H electron volt (3) 

from which the solid curve is drawn. The value of the coefficient is of the 
right order of magnitude as may be seen by calculating the electrostatic 
potential due to one ion at a radius equal to half the average distance 
between ions. This radius will be approximately %N a H = r. The poten- 
tial at that radius is 



— electrostatic volts 
or 



nnn 4.8 X 10" 10 , 

300 — volts 

11. 9r 



= 1.2 X lO-V" 1 = 2.4 X VT s N a H volts. (4) 
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In their calculation a more detailed procedure is used which takes into 
account the average potential of the electron at points nearer the other ion 
and a value of 3 X 10~ 8 iV o H is obtained. 

It has been pointed out by Seitz 3 that there are objections to the pro- 
cedure used by Pearson and Bardeen due to the fact that the wave functions 
implied in their analysis are not orthogonal. Seitz's investigations in- 
dicate that the interpretation of Figure 9.3 may involve quite subtle effects. 

Problems 

1. Obtain formula (10) of Section 9.1 by considering a sphere in mo- 
mentum space with radius P m given by S m = P m 2 /2m. 

2. Calculate the first and second ionization potentials for substitutional 
impurities differing in valence by two from silicon and germanium taking 
24.5 electron volts as the first ionization potential of helium. Compute 
the radii of the wave functions and compare with the lattice constant for 
the case of one bound electron. 

' Personal communication. 



CHAPTER 10 

FERMI-DIRAC STATISTICS FOR SEMICONDUCTORS 

10.1 THE FERMI-DIRAC DISTRIBUTION FUNCTION 

In this section, the effects of thermal agitation in distributing the elec- 
trons among the various quantum states will be described. We shall 
consider first the state of affairs at the absolute zero of temperature and 
then see how this is modified by the excitation of electrons from quantum 
states of lower energy to those of higher energy. The population of 
quantum states at various energies can be described in terms of the Fermi- 
Dirac distribution function, which is discussed in the first part of this 
section. This distribution function is generally applicable to electrons in 
the sense that it is independent of the detailed nature of the distribution of 
quantum states in energy. In the latter parts of the section, the Fermi- 
Dirac distribution function is applied first to a hypothetical example with 
only 36 quantum states and 20 electrons in order to illustrate the general 
features in their simplest form. This example is followed by an application 
of the methods to the distribution of quantum states in energy appropriate 
to an impurity semiconductor. 

At the absolute zero of temperature, the electrons fall to the lowest 
energy states available to them and fill these up subject to the limitations 
imposed by the Pauli exclusion principle. As a result all the quantum 
states below a certain energy are filled and those above that energy are 
empty. If thermal agitation is present, however, electrons will tend to be 
excited to higher states. On the basis of statistical mechanical theory, this 
tendency is expressed in terms of energy by the quantity kT which repre- 
sents the average thermal energy of a one-dimensional oscillator at tem- 
perature T. Accordingly, electrons which at absolute zero have energies 
only about kTless than empty states may be excited to these higher states. 
However, electrons having energies many times kT less than empty states 
have an almost negligible probability of being excited. Thus for the 
equilibrium distribution, the sharp division between filled and empty 
states becomes smeared out into a transition region about 2kT wide at 
higher temperatures. 

This situation is described in detail by the Fermi-Dirac distribution 
function. This function, denoted by the letter f, depends upon 6> and T 
in a way shortly to be discussed. The values of f range from zero to unity 
and are interpreted as follows: 
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a. 



The value of the Fermi-Dirac function f is the probability that under thermal 
equilibrium a quantum state of energy & is occupied by an electron. It is, 
therefore, the average value of the fraction of all the quantum states of energy 
$ occupied by electrons. 

The significance of the properties just described will become more 
apparent in terms of the examples 
discussed below. 1 

The function f is plotted in 
Figure 10.1 and is seen to vary 
f om zero for high energies to 
unity for low energies. It makes 
the transition from values nearly 
equal to zero to values nearly 
equal to unity over a range of 
energy equal to about 4&T. The 
energy &f, for which f takes on 
the value \, is called the Fermi 
level. 

For purposes of comparison with | 
energies at room temperature, a 
scale in electron volts is also shown. 
At room temperature, T = 300° K, 
kT corresponds to an energy of 
0.026 electron volt. It is sometimes 
also convenient to compare these 
energies with the energy of light 
quanta. In Figure 10.2, we com- 
pare the energy scales in terms of 
degrees Kelvin, electron volts, wave lengths in angstroms, and frequency 
v of the light quanta. 

The statistical nature of the Fermi-Dirac distribution function should be 
pointed out. What f prescribes is the probability that a quantum state of 
given energy be occupied. If, for example, the system being considered 
has six quantum states at the energy for which f = J , then under equi- 
librium conditions there will be on the average three electrons in these 
states. However, this number will fluctuate as electrons make transitions 
in and out of the states as a result of thermal agitation, and in fact for 
6 } t of the time the quantum states will be filled with six electrons; for ^j- 
of the time they will contain five electrons ; for -H, four ; for ff , three ; 
for if, two; for -fa, one and for ^ of the time all the states will be empty. 
For this case, with only a small number of states involved, the statistical 

1 An outline of the statistical mechanical theory of the Fermi-Dirac distribution function 
is presented in Chapter 16. 
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fluctuations are large. In a semiconductor, however, we are generally 
concerned with groups containing 10 10 or more states. For such large 
numbers as these, the statistical fluctuations are negligible on a per- 
centage basis for the purposes of this chapter. 

The mathematical form of the Fermi-Dirac distribution function gives 
complete symmetry to holes and electrons as will be shown below by 
consideration of its mathematical form. If, as is indicated on Figure 10.1, 
the energy level for which f = J is denoted by &f> called the Fermi level, 
then the value of f at energy & is given by the formula 2 

f 3 f[( g _ 5p)/kT] = * F .. - (1) 

1 + exp [(& — & F )/kT] 

As explained previously, the value of f gives the probability that a 
quantum state at energy S is occupied. If S is larger than 6>f by several 
times kT, then the exponential in the denominator is large compared to 
unity and f takes the approximate form 

f a* exp [- (S - & F )/kT] for £ > S F . (2) 

This approximation gives the same distribution of electrons as do the 
"classical" statistical mechanical distributions of Maxwell and Boltzmann. 
We shall refer to it and to the corresponding approximation for holes, dis- 
cussed below, as the classical approximation. 

On the other hand, if S is less than &f, the exponential in the denominator 
is small and the approximation (1 + s) — 1 = 1 — s may be used giving 

f 5^ 1 - exp [(5 - S p )/kT\ for £ < & F . (3) 

Thus for energies much above &f, i approaches zero as exp ( — A$/kT) 
where A£ is the height above £/?. Below &f, f approaches unity, the dif- 
ference between f and unity being given by the expression exp ( — A£/£T) 
where AS is now the energy Mow £/?. 

Now, if f is the fraction of quantum states occupied by electrons, then 
obviously 1 — f is the fraction left vacant or, in other words, the fraction 
occupied by holes. The approximation (3) thus shows that the probability 
of finding that a quantum state is occupied by a hole at AS below &f is 
just equal to the probability of finding a quantum state occupied by an 
electron at AS above &f- This result is valid in general and not merely 
for approximations (2) and (3). If we write f p = 1 — f for the distribu- 
tion function for holes, we find by straightforward algebraic manipulation 

f p - f p [(S - 6,)/*n = 1 - f = — * (4) 

1 + exp[(fe F - Q>)/kT] 

* As (1) shows, f is a function of three variables G>, Q>f and T. In cases where T and &p do 
not vary, we shall sometimes write f as simply f or f(e>). 
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which shows on comparison with (1) that f p is the same function of energy 
measured downward from 6>f as f is for energies measured upwards. In 
other words, the function f of Figure 10.1 has a center of symmetry at 
& = &p and f = J. 

The functions f and i p are each functions of the single variable 
(5 — $p)/kT. In many cases, however, 6>p and T are constant and the 
only variable of interest is € or P upon which & depends. In such cases we 
may for brevity write the functions in one of the following forms: 

f = f(S)= f[S (/>)]= f(i>) (5a) 

f p = f,(S) = f p [6(P)] = 1 P (P), (5b) 

the latter forms being specially adapted for use with quantum states in the 
Brillouin zone. This notation is not mathematically consistent but should 
cause no confusion. 

One of the principal problems in determining the equilibrium distribution 
of the electrons in the quantum states arises in connection with the location 
of the Fermi level Sj? on the energy scale. As will be discussed below, the 
location of &f is determined by: (1) the distribution of quantum states for 
the system under consideration ; (2) the total number of electrons in the 
system; and (3) the absolute temperature. The temperature is an im- 
portant variable in determining the location of the Fermi level for semi- 
conductors, the situation here being quite different from that in metals for 
which the location of the Fermi level is substantially the same at all 
temperatures. Once the Fermi level has been determined for a given 
temperature, however, the distribution of electrons in energy can be 
obtained directly by combining the values of N(S), the number of quantum 
states per unit energy range, with f[(S — Sp)/kT], the fraction of these 
that are occupied. Detailed examples of these procedures are given below. 

Under conditions in which S>f is substantially independent of tem- 
perature, the temperature dependence of f for values of S which differ 
from &p by several times kT can be approximated by equations (2) or 
(3). For each of those the number of electrons or holes varies simply as 
exp (— L&/kT) as discussed previously. This type of behavior leads to a 
straight line plot for In f vs \/T as mentioned in Chapter 1, with AS giving 
the slope or activation energy. However, in order to determine what 
AS means in terms of the energy-level diagram, it is necessary to know the 
location of 6>p. This subject is investigated below. 

10.2 FERMI-DIRAC STATISTICS FOR A SIMPLE MODEL 

Rather than discuss initially the case of a semiconductor whose quantum 
state distribution is represented by Figure 9.2, we consider a small system 
with only 36 states and 20 electrons. The advantage of dealing with a 
small system is that illustrative distributions of electrons can be shown in 
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detail rather than in analytical form. For the small numbers involved, 
statistical fluctuations are important and should be considered; we dis- 
regard them here, however, since the general features to be brought out are 
applicable for large systems where fluctuations are unimportant. The 
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quantum state distribution, Figure 10.3, has been drawn to represent 
approximately two energy bands, so that it has some similarity to the 
distribution in a semiconductor. 

In Figure 10.3(a) we represent the situation at the absolute zero of tem- 
perature. At this temperature the electrons will attempt to occupy the 
lowest quantum states; however, because of the Pauli principle that not 
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more than one electron may occupy a given state, they will fill up states to 
relatively high energies, as is shown. At the absolute zero of temperature, 
the transition of the Fermi function from zero to unity will be infinitely 
sharp, as a result of which all of the states above the level &f are empty 
and all of those below are filled. Since the system considered contains 20 
electrons as shown, the Fermi level must lie between the energy levels $ a 
and S> b . If it were drawn below & a , the system would contain a maximum 
of 18 electrons, and if it were drawn above &b, it would contain a minimum 
of 24 electrons. 

For higher temperatures, however, it is found that the Fermi level must 
lie below 6> a - This can be understood by considering (b), for which kT is 
comparable to the spacing between levels in the upper group. For this 
case, the Fermi level is drawn as though it were precisely at the energy 6> a . 
Under this condition, on the average, half of the states at 6> a will be occu- 
pied. Since electrons are excited well above the Fermi level for the case 
shown, there is an appreciable probability of finding electrons at energies 
Sj, and &c, and the expected numbers are approximately one at each of these 
higher energy values. Of course, while the situation represented in (b) is 
only one of a large number of possibilities which are simultaneously repre- 
sented by the Fermi function, it has been drawn so as to represent correctly 
the average behavior to be expected so far as numbers of electrons at each 
energy are concerned. Accordingly, we will find an average of three 
electrons in the upper group. On the other hand, the lower group of quan- 
tum states lie so far below the Fermi level that the probability of finding a 
hole in them is negligible for the situation represented in (b). Conse- 
quently, we see the total number of electrons for this case is 21 or one higher 
than the required number. In order to produce the desired number of 
20 electrons, it is necessary to shift the Fermi level downwards as is shown 
in (c). This downward shift of the Fermi level with respect to the dis- 
tribution of quantum states decreases the probability that any state is 
occupied by an electron, as may be easily seen from the shape of the Fermi- 
Dirac distribution curve. As a consequence, the shift downwards of (c) 
compared to (b) decreases the probability that the states in the upper 
group are occupied; and if the correct shift is made, the probability will 
be such that on the average only two electrons will be found in the upper 
group and thus the required number, 20, will be achieved. 

As mentioned before, the shift of Fermi level with temperature is a 
characteristic feature in the behavior of semiconductors. It is quite 
different from the situation in metals where the Fermi level at higher 
temperatures comes at very nearly the same level as it does at absolute zero. 

At still higher temperatures, the situation for our simple example will be 
as shown in (d). In this case kT'is comparable to the spacing between the 
two groups of quantum states. Under these conditions, some electrons 
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will be excited from the lower group to the higher group, and in order to 
keep the total number of electrons equal to 20, the Fermi level has been 
moved still farther down and nearer the center position between the two 
groups. 

We must next consider the extension of these results in analytical form 
to the quantum state distribution of a semiconductor. 

10.3 FERMI-DIRAC STATISTICS FOR A SEMICONDUCTOR 

An essential condition to be met in the interior of a semiconductor is that 
of electrical neutrality. This condition follows from the well-known 
theorem in the theory of conductivity that the net charge density within a 
conductor at equilibrium must be zero. As applied to a semiconductor this 
requirement is equivalent to that used in Section 10.2, namely, that there 
must be a fixed total number of electrons in the quantum states. We shall 
first consider the factors which determine this number and next see how 
the Fermi level must be adjusted to meet the requirement. 

Since we are interested in relatively small deviations from the perfect 
valence-bond structure, it is convenient to deal not with the total number of 
electrons, but instead only with the net excess over the number required 
exactly to fill the valence-bond structure. This net excess number of 
electrons per unit volume is made up of four contributions: 

n = number of excess electrons in the conduction band (that is, 

not bound to donors) per unit volume of crystal; 
rid = number of excess electrons in bound states about donors per 

unit volume of crystal ; 
p = number of holes in the valence-bond band (that is, not 

bound to acceptors) per unit volume of crystal; 
p a = number of holes in bound states about acceptors per unit 
volume of crystal. 

If all these quantities are zero, the valence bonds will be complete and 
there will be no excess electrons. If they are not zero, then the net excess 
number of electrons is clearly 

n + n d -p-p a . (1) 

In addition to the four densities just defined, a number of others will be 
required in discussing the statistics of this chapter. For convenience we 
tabulate them below, although N c and N v will not be precisely defined until 
later. For brevity we use "density" to replace "number per unit volume". 

If the valence-bond structure were perfect and there were no excess 
electrons, the crystal would not, in general, be neutral because of the 
localized charges of the donors and acceptors. As discussed in Chapters 1 
and 9, when the valence-bond structure is perfect around a donor atom, 



238 FERMI-DIRAC STATISTICS FOR SEMICONDUCTORS [10.3 

Table 10.1 



Positive Charges 



Negative Charges 



Other Densities 



p = density of holes in valence-bond band 
p a = density of holes bound to acceptors 
Pt = P + Pa 
Nd = density of donors 

n = density of excess electrons in conduction band 
na = density of excess electrons bound to donors 
«( = » + rid 
Na = density of acceptors 



N, = number of atomic sites per unit volume 
N c = N c (T) = effective density of states in conduction band 
N v = N p (T) = effective density of states in valence-bond band 



the latter represents a localized positive charge and an acceptor represents 
similarly a net negative charge. For donor and acceptor densities of N d 
and N a this means that there would be a net charge density in the semi- 
conductor of e{Nd — N a ) if the valence-bond structure were complete. 
Since (1) represents the excess electrons over a complete valence-bond 
structure, the net charge density, which must Vanish, is given by 

e(N d - No) ~ e{n + n d - p - p a ) = 0. (2) 

This can be rewritten as 

» + n a - p - p a = N d - N a (3) 

which states that the net excess of electrons is just equal to the excess of 
donors over acceptors (a situation discussed in connection with Figure 1.9). 
The values of w, n a , p and p a are determined by (1) the distribution of 
quantum states in energy, (2) the value of &p, and (3) the temperature. 
The appropriate equations are obtained as follows: In the energy range 
d& there are dS = N(tb)d& quantum states. A fraction f [(§ — &p)/kT\ 
of these are occupied, leading to 

f[(S - & F )/kT]dS = f[(S - $ F )/kT]N(S)dS> (4) 

electrons in states in the range d&. If this quantity is integrated over a 
range of energies, such as &i to & 2 shown in Figure 10.4, which covers the 
donor states and the conduction band, it will give n d plus n. Thus we shall 
have 

n t = n + n d = [^[(S - & F )/kT\N{&)d&. (5) 

The exact value of S 2 is unimportant since f approaches zero rapidly at 
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high energies. In fact, f is negligible except near the very bottom of the 
energy band. This is illustrated in part (b) of the figure in which N(&)i 
and N(&)f p are plotted, and it is seen that the important parts of the 
integrals come near the edges of the bands. This subject will be dealt with 
in more detail later. A similar calculation gives the number of holes: 



Pt = P +Pa = f\[(6 - &r)/kT\N{&)d& 
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where S3 to S4 includes the full band and the acceptor levels and f p = 1 — f 
(given by [equation (4) of Section 10.1]) is the probability that a quantum 
state of energy & is vacant (that is, occupied by a hole). 

For a given temperature T, both rtt and pt of equations (5) and (6) are 
functions only of €/?. Hence the equation 



n t 



pt = N d - N a 



(7) 



may be regarded as containing only one unknown, namely &f, since Nd, N a 
and the distribution N(&) may be regarded as specified by the nature of the 
semiconductor considered, the situation being similar to that discussed in 
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connection with simple model containing 20 electrons of Figure 10.3. In 
Chapter 16, some detailed examples of solving for 6>p are given. In this 
section we shall indicate only how n t and p t vary in general and then 
describe the final statistical results. 

In the integral for n h the integration extends over the Nd donor levels. 
These are all supposed to lie at one energy 5>d for which the f has the value 
f[(&d — &F)/kT]; hence, the integral gives for the number of electrons in 
the donor levels the value 

n d = N d f[(& d - $F)/kT\ (8) 

as is indicated on Figure 10.4. 

The number of electrons in the states in the conduction band is obtained 
by an integration over N(S) near the bottom of the conduction band. 
The situation is simplified by the fact that, for most cases, &f lies well 
below the bottom of the conduction band so that approximation (2) of 
Section 10.1 may be used. This enables us to write 

n = e &FlkT p N(&)e- & ' kT dS. (9) 

t/Sc 

Since the exponential decreases by e~ 1 for each increment kT of S, we may 
limit, as a rough approximation, the integration to the interior of the 
energy surface in the Brillouin zone for which & — & c = kTorP 2 /2m = kT. 
Since kT is very small compared to the width of the energy band, the 
approximation [equation (11) of Section 7.5] & = 5> c + P 2 /2m may be 
used so that the energy surface is a sphere containing a volume 4irP s /3 
where P 2 /2m = S - & c = kT. This sphere contains (2/A 3 )(4ttP 3 /3) = 
SirF^/Sh 3 quantum states per unit volume of the crystal. Since in this 
volume exp ( — &/kT) is approximately exp ( — & c /kT), we may approxi- 
mate the integral by exp (-& e /kT) times 87rP 3 /3A 3 . Writing P={2mkT) y \ 

n £* e -<Sr_s ' ) /fcr (87r/3) {2mkT/h 2 f\ (10) 

The exact integration carried out in Chapter 16 gives 

n = e-^ SF)lkT 2(2TrmkT/h 2 ) H = N^ 6 ^^'", (11) 

a result about one-third larger. Comparing this with expression (8) above 
for rid and using approximation (2) of Section 10.1 for f, we see that the 
conduction band behaves like a group of states localized at energy 6> e , 
the number in the group being not Nd but instead 

N c = 2(2wmkT/h 2 ) H = 4.82 X W 15 T H cm" 3 . (12) 

[A factor {m n /m)' ii is to be introduced if the effective mass is not m\ N c 
may be referred to as the effective density of states in the conduction band. 
The expression for n is valid only when 6>p is less than &d by several times 
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kT. At room temperature, with m taken as the mass of the free electron, 
N c = 2.41 X 10 19 cm 3 or about 1/2000 the atomic density. TV,, varies 
slowly with T compared to the exponential functions involved in the theory 
and can be regarded, for many purposes, as substantially constant. 
Precisely similar calculations may be made for p a and p giving 

Pa = N a i p [(&a - S F )/kT] = NJ[(& F - 6> a )/kT] (13) 

p = 2(2«rcFf/£ 2 ) M e- (5 '- 6 " )/ir = N*-**-** ' ' kT '. (14) 

N v and N c will differ if the electrons and holes have different effective 
masses. 

The use of N c and N v is limited to cases for which f and fj, can be approxi- 
mated by the simple exponential functions of equations (2) and (3) of 
Section 10.1. If the concentration of carriers is increased sufficiently, these 
approximations are no longer valid and other formulae must be employed. 
We shall illustrate this for the case of electrons. 

(The reader may skip the next three paragraphs and Figure 10.5 on a 
first reading.) 

If the number of electrons in the conduction band is very large, say one 
per atom as in the case of a monovalent metal like sodium or copper, then 
the Fermi level will lie very far above the bottom of the conduction band. 
For the case of sodium, which has been most extensively investigated, there 
is experimental and theoretical evidence that the energy is given by 
P 2 ' /Im up to the highest occupied levels so that the equation (9) of Sec- 
tion 9.1 for N(6>) may be used. Calculation then shows that the levels 
must be occupied up to a maximum energy S m of about 3.16 electron volts 
above the bottom of the conduction band. This energy is so much greater 
than kT that the transition region off is negligible in comparison, and the 
value of $>f may be calculated simply by solving equation (10) of Section 
9.1 for & F : 

N(&)d& = (8Tr/3h 3 )[2m(& F - S e )] M (15a) 

S F - S c = & m = .tf(3n/9r) H /2m = 21.6 X lO~ 16 n H electron volts (15b) 

where G> F = § c + S m , and m is taken as the free electron mass. Cases of 
this sort, for which the temperature no longer has an appreciable effect, are 
said to be degenerate and the distribution of electrons is called a degenerate 
electron gas. 

For any given temperature, the statistics of the electrons in the band 
will vary from classical to degenerate as the number is increased. The 
dividing line is conventionally described by the degeneracy concentration 
»deg or the degeneracy temperature Td eg . These two quantities are related 
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by the equation, 



■* deg 



-ET 



Urn 



n dei 



= 4.2 X \(T ll n Ae . v \ 



(16) 



This equation is interpreted as follows: At absolute zero the electrons are 
condensed into the states of lowest energy and require all the states up tch* 
certain energy S m given by (15). The degeneracy temperature defined l?f 
the equation satisfies the equation kT deg = & m . In other words, if T 'is 
high enough to raise electrons from the bottom of the band to levels higher 
than $ m , the degeneracy will be largely destroyed. Equation (16) serves 
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Fig. 10-5 — Exact Dependence of n Upon (& F — d> c ) and T Compared to 

Limiting Forms. 



equally well to give the degeneracy temperature for a given concentration 
or the concentration for a given temperature. 

When n is comparable to » deg , there is no simple approximate formula 
for n in terms of T and &f — & c , and the value must be obtained by an 
exact integration 1 of N(S){ (&)d&. The result of the exact integration is 
shown in Figure 10.5. On this plot the limiting forms (11) and (15) are 

1 J. McDougall and E. C. Stoner, Roy. Soc, London Trans. 237A, 67-104 (1938); Muller- 
Pouillet, Lehrbuch der Physik 4 (4), pp. 271 ff.; A. Sommerfeld, Zeits.fiir Physik 47, 1-32 
(1928); L. Nordheim, Annalen der Physik 9, 607-678 (1931). 
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also shown. The point corresponding to n = n deR is given by & m = kT 
and thus corresponds to the value of n for §>f — &c = kT on the com- 
pletely degenerate curve, this value of n being about 0.75N C . The exact 
solution for n = 0.75N C corresponds to & F — & c slightly negative so that 
even the states at the bottom of the band are less than half-filled and the 
distribution is not very degenerate. The distribution will be quite de- 
generate, however, for situations in which 6> F — 6> c is 2kT or more. This 
corresponds to values of n about three times larger than the «de g correspond- 
ing to a given T, or conversely to temperatures of 3 - ^ = 0.48 as large as 
the Tdeg corresponding to a given n. 

We shall have occasion in Chapter 11 to discuss degenerate cases. For 
the purposes of illustrating the method of finding the equilibrium distribu- 
tion in a semiconductor, however, it is simpler and equally instructive to 
consider a non-degenerate case. Accordingly, we return to the expressions 
for n, p, n d , and p a which give these quantities as simple functions of 6>p. 
These are substituted in the equation (3) for electrical neutrality, which 
then becomes an equation which may be solved for &f- As mentioned 
previously, graphical procedures for obtaining the solution are shown in 
Chapter 16. The description of the results obtained for a particular 
example is as follows: 

We consider a case for a germanium sample which, like the silicon 
sample A of Figures 1.9 and 1.12, has N d > N a - In particular we have 
chosen N d = 10 15 cm -3 and N a = 10 14 cm -3 . The energy scale has been 
chosen with zero midway between the energy bands and with E c = 0.36, 
E d = 0.32, E a = -0.32, and E v = -0.36 electron volt. The dependence 
of 6>f upon T for this case is shown in Figure 10.6, the calculations being 
described in Chapter 16. Qualitative descriptions of a number of the 
features of Figure 10.6 are given below. 

At very low temperatures we have 

n d = N d - N a (17) 

since there will be no holes and all the extra electrons are in bound states 
on the donors. Hence, a fraction n d /N d of the donor levels will be filled. 
Consequently, Sj? must be within a few times kT of & d ; for otherwise, 
the states would be either all full or all empty in accordance with the 
behavior of f at low temperatures shown in Figure 10.3(a). Since &f 
is near & d > the number of electrons in the conduction band will vary as 
exp [— (& e — &d)/kT] in accordance with equation (11). Thus, for this 
case, the activation energy obtained from a log n vs 1/T plot will be the 
binding energy & c — & d of an electron in a donor state rather than half 
this value. The temperature range over which this approximation is 
valid extends only up to about 10°K as may be seen from the plot of 6>f 
versus T given in Figure 10.6. 

As the temperature is increased still further, an appreciable number of 
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electrons are excited to the conduction band, and &f tends to move down 
so as to keep n t constant, by reducing the number of electrons in the donor 
levels. Over the temperature range for which there is substantially com- 
plete ionization of the donor levels, the equation 

n^N d -N a (18) 




200 300 400 

TEMPERATURE IN DEGREES KELVIN 



600 



Fig. 10-6 — The Fermi Level for Electrical Neutrality Plotted Against Temperature, 
for Germanium with N d - 10 16 cm -3 and N = 10 14 cm -3 and 8 C — & d = 0.04 ev. 



holds. This temperature range is referred to as the saturation range since 
all available excess electrons are in the conduction band. The saturation 
range is entered as soon as 6>p has fallen several times kT below &d so that 
the donor levels are substantially empty. On Figure 10.6, the value of kT 
in electron volts is shown as a function of temperature ; it is seen that at 
about 100°K, S d — & F > IkT so that for temperatures above this the 
donors are fully ionized. In this range, increasing the temperature requires 
a decrease in &f for the same reasons shown in Figure 10.3(b) and (c). 

This decrease finally brings &f to a position a few kT above the midpoint 
between the bands. As soon as this occurs, an appreciable number of holes 
is produced. At still higher temperatures, &f approaches the midposition 
more closely, for otherwise either the holes or the electrons would greatly 
outnumber the other, and the condition, equation (3), of electrical neu- 
trality could not be satisfied. As a consequence of this stabilization of &f, 
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both w and p increase exponentially as exp [— (5 C — Sj?)/£T] and 
exp[-(§F — S„)/£T]; since & F is midway between S c and S v , these 
both reduce to exp [- (S e - & v )/2kT] so that, as mentioned in Chapter 1, 
the activation is half the forbidden energy gap. The intrinsic range has 
been selected for Figure 10.6 as the range for which n/p < e, that is, it is 
the range for which & lies less than \kT above the midpoint of the energy 

gap- . 

In the event that N a is greater than A r d , there is always a net excess of 
holes. Because of the general symmetry between hole densities and 
electron densities, as shown in equations (3) of Section 10.1 and (7) to (14) 
of this section, an entirely similar behavior occurs, with tb F starting at & a 
at low temperatures and rising to (S c + S„)/2 at high temperatures. 

The description in this section thus puts in somewhat more formal terms 
the physical picture described in Chapter 1. In Chapter 16, additional 
details are given showing more fully the trend for & F and the way in which 
the electron and hole densities vary. 

This section may be summed up by saying that, once the distribution of 
quantum states in energy is known and the temperature specified, a definite 
procedure can be stated for finding the equilibrium distribution of electrons 
and holes. This procedure requires the Fermi level, & F , to be adjusted to 
give electrical neutrality [equation (3) of Section 10.3]. Certain simplifica- 
tions can be introduced in the procedure, such as the use of N e and N v , the 
effective numbers of quantum states in the conduction band and valence- 
bond band; the general trend of the behavior can be seen to be that dis- 
cussed in Chapter 1 in connection with Figures 1.9 to 1.12. However, 
there are few simplifying features, and only for the low temperature and 
intrinsic ranges can a simple interpretation be given of the slopes of the 
plots of the logarithm of n versus \/T. 

The fact that there is complete statistical symmetry between holes and 
electrons, as shown by equation (4) of Section 10.1 and its consequences, 
should be noted. This symmetry, together with the results of Chapters 7 
and 8, shows that the only significant feature distinguishing the behavior 
of holes from that of electrons in semiconductors is the sign of the effective 

charge. 2 

An important consequence of the form of the approximations (11) and 
(14) for n and p is that their product is a function of T only and is inde- 
pendent of n and^> individually: 

np = N C N V exp [(S - 5 v )/kT\ = N C N V exp (-6 /*T) 

= 2.33 X 10 31 T 3 exp (-Sg/kT) cm" 6 . (19) 

For intrinsic material, n and p are equal to each other and each is equal to 
2 The mathematical investigation of the equivalence takes major portions of Chapters 15 
and 17. 
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the square root of equation (19). We shall use this constancy in the next 
section and shall discuss it further in Sections 12.4 and 16.4 and in connec- 
tion with its use by Pearson and Bardeen in analyzing silicon alloys. 
Equation (19) can also be derived on the basis of the statistical theory of 
detailed balancing, which requires that the rate of recombination of holes 
and electrons (proportional to np) should equal their rate of generation, 
which should depend on T as does the right side of equation (19) but not 
on n or p. 

10.4 APPLICATION AT ROOM TEMPERATURE 

Because of the practical interest in high resistivity germanium at room 
temperature, we shall present a simplified analysis and describe the results. 
In Section 16.3 the same problem is analyzed by the general methods of 
Section 10.3 making use of illustrative values of 0.72 electron volt for 
& c — £>„ and 0.04 electron volt for & a — tb v and S c — &d- We shall use 
here a somewhat more direct procedure which utilizes different experimental 
data. If the intrinsic line for germanium is extrapolated to room tempera- 
ture it yields a value of 48 ohm-cm for the resistivity. 1 On the other hand 
the resistance of a relatively pure w-type sample may be 2.4 ohm-cm. 
(Using 2600 cm 2 /volt sec for the mobility for electrons in germanium, this 
gives a concentration n = cr L /eLfiL = 1/2.4 X 1.6 X 10 -19 X 2600 = 
1.0 X 10 15 electrons/cm 3 in agreement with the example of Chapter 16.) 
The ratio of mobilities for germanium is estimated 2 to be b = 1.5. From 
this we can determine directly the ratio of n' (single prime for the 2.4- 
ohm sample) to n" (double prime for the intrinsic sample) as follows: For 
the intrinsic sample p" = n" , almost by definition. Therefore, its con- 
ductivity is 

a" = e(n",x n + p",x p ) = <? Mp (l + b)n". (1) 

For the 2.4-ohm sample, p is negligible (as we shall verify below) and 

<j' = e\i n n' = ey. p bn . (2) 

Dividing the second by the first gives 

J_ _ 48 bn'_ 

a" ~ 2.4 ~ ~ (1 +~b)n 
whence 



// - o , - 20 = 7^-1^77 (3) 



i-m»-»- 



e J - 5 . (4) 



1 Since this section was written, better germanium specimens have become available and 
a new estimate of 60 ohm-cm is considered best. The values of /t n and ju p have also been re- 
vised as a result of Haynes' experiments. It has not been practical to revise all examples in 
the text accordingly, and this section, together with Chapter 16, is now somewhat inaccurate. 
They are internally consistent, however, and illustrate the principles involved. 

2 See Section 12.9 for a review of the best available data. 
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From equation (11) of Section 10.3, which states that 
n = JV c exp[-(S c - & F )/kT\y 



(5) 



we see at once that G>f must be 3.5kT higher for the 2.4-ohm-cm sample. 
Since, for the intrinsic sample, &f is midway between the bands, 6>f for 
the 2.4-ohm sample must be 3.5kT or 0.09 electron volt above the mid- 
point. This situation is summarized in Figure 10.7 which also shows a 
possible impurity distribution. The same situation (except for small dif- 
ferences in rounding off numbers) is represented more analytically in 
Chapter 16, Figure 16.2. 
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Fig. 10-7 — Theoretical Distributions of Quantum States in Energy, Effective Densi- 
ties, and Densities of Electrons and Holes for Germanium. (The values shown are 

illustrative and not exact.) 

It should be noted that n'p' = n"p" in Figure 10.7. This is a direct 
consequence of equation (19) of the last section. 

A plot for 2.4-ohm-cm silicon would look much the same as Figure 10.7, 
and the Fermi level would come at about the same distance (about 0.27 
electron volt) from the conduction band. However, since the energy gap 
for silicon is 1.12 volts, the Fermi level would lie 0.29 electron volt above 
the midpoint, and the number of holes would be very small. 

The changes from w-type to p-type are obvious. 

At room temperature, diamond is an insulator. Its conductivity is, 
therefore, 10 — 13 ohm -1 cm -1 or less. The mobility of electrons and holes 
is probably about one tenth as much for diamond as for germanium, hence 
the density of carriers will be only about IO 3 cm -3 . Taking N c as 10 19 t 
this means that f = IO -17 and, consequently, &f is not within 1 electron 
volt of either the conduction or valence-bond band. This energy difference 
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of 1 electron volt is larger than that predicted for the binding energy of 
donor levels (0.35 electron volt) and suggests that there are levels of other 
sorts lying at least 1 electron volt away from the edges of the energy gap. 
Further evidence for such levels is furnished by data on the ultraviolet 
absorption of diamond which varies from sample to sample from around 
7 to 4 electron volts, indicating levels lying several volts from the edges of 
the energy gap. Further evidence for such deep lying levels has been 
obtained from studies of the release of trapped electrons and holes in 
experiments on electron bombardment induced conductivity. 3 

Applications of the statistical methods discussed here to cases in which 
the distribution of holes and electrons vary from one part of the semi- 
conductor to another have been carried out in various connections. Some 
of the basic equations required are discussed in Section 12.4. Some of the 
applications of particular interest apply to metal semiconductor contacts 
and are presented in the references quoted in Section 4.3. The distribu- 
tion of potential and its dependence on temperature for a semiconductor 
with surface states has been treated 4 and similar calculations have been 
carried out in some detail for p-n junctions. 5 An excellent review of the 
problem with emphasis on metal-semiconductor contacts is contained in 
a paper by Slater. 8 

Problems 

1. The equilibrium condition for temperatures below the intrinsic range 
may be regarded as resulting from the "pseudo-chemical" reaction 

excess electron + ionized donor <=± neutral donor. 

The three concentrations involved are respectively n, N d — n d , and n d . 
The mass action law would then give 

n(N d - n d ) 

Derive this equation employing the approximation (10) of Section 10.3 
and show that 

K = N c exp [($ d - & c )/kT]. 

(This result corresponds to equations (22) and (23) for G. L. Pearson and 
J. Bardeen, Phys. Rev., 865-883 (1949), except that our value of K is twice 
theirs. The difference arises from the fact that an electron trapped on a 
donor may have either plus or minus spin and, therefore, the correct expres- 
sion for n d is N d f[(S d - kTln 2 - & F )/kT]; see Problem 2, Chapter 16.) 

3 K. G. McKay, Phys. Rev. 77, 816-825 (1950). 

4 J. j. Markham and P. H. Miller, Phys. Rev. 75, 959-967 (1949). 

5 W. Shockley, Bell Syst. Tech. J. 28, 435-489 (1949), references. 

6 J. C. Slater, Phys. Rev. 76, 1592-1601 (1949), references. 
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If the mass action law is combined with the condition for electrical 
neutrality, show that 

»(« + N a ) = K 
(N d -N a -n) 
and that 

(n + K)(n + N a ) = N d K. 

This quadratic may be solved for n so that n may be plotted as a function 
of T. This formula is used by Pearson and Bardeen to fit Figure 1.12; 
they use N d = 12 X 10 16 , N a = 1.5 X 10 16 , & c - S d = 0.045 electron 
volt, m n = 0.33 free electron mass and the factor of (^), mentioned above, 
in K. Calculate some values of n from this formula and compare with 
Figure 1.12. 

Refer to Chapter 16, equations (10) and (11) of Section 16.4, and com- 
pute the values of p shown in Figure 1.12. 

2. Obtain the formula for JV C or N v by integrating f directly overP-space. 
See Section 16.2 for a hint. What is the significance of the f power in 
equation (12) of Section 10.3? 

3. Two groups of quantum states N\ and N 2 in number at energies 6>i 
and §2 are occupied by »j and n 2 electrons. Suppose electrons in Ni can 
make transitions to any vacant state in A^ 2 with transition probability T i2 
per unit time. The rate of transitions from Ni to N 2 is then 

niT 12 (N 2 - n 2 ). 

From general statistical theory, the transition probability T 2X from 2 
to 1 is 

T — T «(S2-6i)/*r 
^21 — J 126 

Show that if w x and n 2 are given by the Fermi-Dirac distribution, then the 
rate of transition from 1 to 2 is just balanced by that from 2 to 1: 

n 2 T 2l (N x -m). 

Show that if a third group of quantum states, described by N 3 , & 3 , and 
«3, is in equilibrium with 5 1, it must also be occupied in accordance with the 
same Fermi-Dirac distribution. 



CHAPTER 11 

MATHEMATICAL THEORY OF CONDUCTIVITY 
AND HALL EFFECT 

11.1 INTRODUCTION 

In this chapter, the basic formulae for mobility and Hall effect are 
derived. The procedure is divided into two parts. In the first part it is 
shown that any disturbance of the distribution of electrons from the equi- 
librium value tends to decay, owing to the "scattering" of electrons, and the 
effect of this decay upon the current which may be present is evaluated. 
In the second part, expressions are derived for the rate at which current 
builds up due to the application of electric and magnetic fields. Since only 
a linear theory is desired, the rate of build-up due to an electric field can be 
calculated from the undistorted or equilibrium distribution, the effect of 
the electric field upon the distorted part of distribution being quadratic in 
the electric field. For the effect of a magnetic field, however, it is necessary 
to consider disturbances produced by the electric field in order to get any 
effects. The steady-state solution is then found by equating the rate of 
build-up of current to the rate of decay. The problem is more complicated 
than that of Chapter 8 because the rate of decay is not the same for all 
classes of electrons, and suitable averaging processes must be carried out. 

In Sections 11.2 and 11.3 the decay process is considered. Section 11.2 
is concerned with a general formulation of the decay process and with the 
derivation of formulae useful for application to specific cases. In this 
treatment it is shown that the rate of decay is not affected by the fact that 
the Pauli exclusion principle prevents electrons from making transitions to 
already occupied states. In this section, reference is made to parts of 
Chapter 17 in which the quantum-mechanical theory of transition prob- 
abilities is given in detail. 

In Section 11.3 we treat the two most important transition-producing 
mechanisms: scattering by charged ions, similar to the Rutherford scatter- 
ing of a-particles by atoms which first gave evidence of the nuclear struc- 
ture of atoms, and scattering by thermal vibrations of the atoms. The 
first mechanism is treated on the customary basis of an approximate model. 
The thermal scattering, which is treated in detail in Chapter 17, is given a 
brief and simplified treatment in Section 11.3 which serves to illustrate the 
principles involved. 

2S0 
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In Section 11.4, we treat the effect of the combined interaction of the 
applied fields and the decay process. The effect of an electric field upon 
the equilibrium distribution is evaluated and combined with the laws of 
decay derived in Sections 11.2 and 11.3, and the steady-state distribution 
is determined. The effect of a magnetic field upon the steady state due to 
the electric field is next considered, and a modified steady-state distribu- 
tion is determined. From these steady-state distributions, general expres- 
sions for mobility, conductivity, and Hall effect are found. These general 
expressions are worked out in detail for a number of cases of particular 
importance in semiconductors. 

The formulae for intrinsic semiconductors, in which holes and electrons 
carry comparable portions of the current, are given in Chapter 8 in terms 
of the mobilities of holes and electrons. These formulae are repeated in 
the notation of this chapter in Section 11.4. 

In Section 11.5a comparison between theory and experiment is presented. 

11.2 THE RELAXATION TIME 

In Chapter 8 we gave a simplified treatment of the scattering process 
based on two assumptions : 

(1) The probability that an electron (or hole) makes a transition in any 
small interval dt of time dt/r where t is a constant. 

(2) The probability of a transition to any particular end state is inde- 
pendent of the initial state and is directly proportional to the probability 
that the end state would normally be occupied in the thermal-equilibrium 
distribution. 

In Chapter 17, an analytical examination of these assumptions is carried 
out on the basis of Schroedinger's equation. It is found that, for cubic 
semiconductors such as silicon and germanium and the customary assump- 
tion of spherical energy surfaces, the probability of transition depends 
neither on the direction of motion nor the time after the last transition 
but, in general, does depend on the speed of motion. Thus t in assump- 
tion (1) must be replaced by t(v) where v = \v\ is the speed, or by t(€) 
where & is the energy. 

It is also found that assumption (2) is in error and that the electron can 
gain or lose only a small fraction of its energy in a given collision. These 
gains and losses are important in converting to heat the electrical power 
delivered to the electrons in the form of I 2 R or V 2 JR. However, these 
effects are quadratic in the field and can be disregarded in developing the 
linear theory of mobility and Hall effect. The reason that the change in 
energy is small is discussed in detail in Section 17.6 in connection with 
equation (31). In brief, the electron can gain or lose in one transition an 
amount of energy hv where v is the frequency of the atomic vibration with 
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which it interacts. Such a quantum of energy is called a phonon. The 
vibrations with which the electron interacts are those with wave lengths 
comparable to the electron's wave length. This condition leads to the 
relationship 

mv — h/\ for the electron, (1) 

v = c/\ or hv = ck/X = mvc for the phonon, (2) 

where c is the speed of sound in the crystal. The ratio of phonon energy 
to electron energy is thus 

2c /v = (10 6 cm/sec)/ (10 7 cm/sec) = KT 1 (3) 



mo i /2 



for typical values at room temperature. Thus in one collision the electron 
may change its energy by about 10 per cent. This change is relatively 
small; furthermore, there will be balancing effects, and electrons falling to 
lower energies will be replaced by other electrons excited to higher energies. 
For these reasons negligible errors will be introduced by making the cus- 
tomary approximation of conductivity theory that energy is conserved in 
the transitions. This result is in disagreement with assumption (2) which 
supposes that the energy after transition is independent of the energy 
before transition. 

Assumption (2) also fails in regard to the distribution of directions after 
collision. As we shall show in Section 11.3a, for scattering by ionized 
donors and acceptors there may be a high degree of correlation between the 
direction of motion before and after collision. We shall derive in this sec- 
tion a general formula which takes into account such correlations and will 
apply it to impurity scattering in later sections. 

Figure 11.1 represents the case with which we shall be chiefly concerned. 
It represents in momentum space two equal energy surfaces separated by a 
small energy difference 8&. An element of volume in the shell between the 
two surfaces may be described by the corresponding surface area 50, which 
has the dimensions of P 2 or (momentum) 2 . (We shall use the symbol 
5 for these infinitesimal areas in this section so as to reserve d for changes 
occurring during a time interval dt.) An electron in a state Pi may make 
transitions to other states of almost exactly equal energy. As a result of 
these transitions, the electrons (more exactly, the states occupied by 
electrons) become uniformly distributed in the shell so that the probability 
that any given state be occupied by an electron becomes the same for all 
states and acquires the value f given by the Fermi-Dirac distribution func- 
tion of Chapter 10, evaluated for the energy of the shell considered. 

The number of states in the range S&8 U{ is obtained as follows: The 
volume in momentum space of the element is dtt times the shell thickness, 
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dP x . Evidently, 



THE RELAXATION TIME 



5S = M(P)|,5Px = VidP ± , 
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(4) 

the last equality following from the group-velocity formula. Hence the 
volume of the element is SP ± 5Qi = 5&S£2,-/i>j. The density of states in 
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CRYSTAL MOMENTUM, P x > 

Fig. 11-1 — Notation Used for Describing Transitions from States in Range 5850; to 
States in Range 5850i. The Change in Direction, 8, Is Useful for Spherical Energy 

Surfaces. 

P-space is 2F//i 3 where /^is the volume of the crystal, as for equation (12) 
of Section 5.5. Hence the number of states Ssi in the element is 

dsi = (2F/h 3 )5&5Qi/vi = 2 Pi 5S5fi i 



(5) 



where 



Pi = F/h\ (6) 

is the density of states of each spin per unit energy per unit surface area. 
For the case in which the band is nearly empty, so that the effect of the 
Pauli exclusion principle may be neglected, the transitions may be described 
in terms of a probability parameter Wij. This parameter is used to specify 
the total probability of transition from a state i to all of the states j lying 
in a region dQj of the energy shell. In these transitions there may be 
energy losses or gains. For reasons discussed above we neglect these 
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changes in energy and assume that all the states to which state P* can 
make transitions lie in the shell 5& so that we may speak of transition 
probability per unit area of the shell. Accordingly the probability that an 
electron makes a transition from a state near Pi to a state in range 5Qy 
near Pj in time dt is 

WifiQjdt. (7) 

The total number of transitions of this sort is obtained simply by mul- 
tiplying this probability by the number of electrons in the element dQ,n 
and this number is simply dsi times the probability f,- that these states are 
occupied. Hence the total rate of transition from dSdQi to d&dUj is 

(8&5Q l -> 8&8Slj) = UsiiVijSty (8) 

= f i 2 Pl ff r l jd&Sn i SQ j . (9) 

Similarly the transitions from^' to i will be given by 

(S&SQj -» 8&8Qi) = fjlpjWjiS&SQiSQj. (10) 

These expressions are found to be consistent with the principle of de- 
tailed balancing 1 which states that each process and its reverse occur with 
equal frequency under equilibrium conditions. For thermal equilibrium 
f; = ij = i(S), the Fermi-Dirac distribution function for the energy 5 of 
the shell. Furthermore, as discussed in Section 17.2, the quantum- 
mechanical formulae for JVa and W$i are such that 

pWh = PjWji\ (ii) 

hence the two rates previously evaluated are equal. We shall make use of 
this result later in this section in order to determine the effect of the Pauli 
principle upon the relaxation process when a large fraction of the states are 
occupied. 

The total probability of transition from the state Pi is simply the sum of 
all the transition probabilities to all areas dilj. We shall write it as Wi 
where 

Wi = £ WijdSl) = 1/t c . (12) 

This probability is essentially the reciprocal of the mean free time between 
collisions, denoted by t c since, by the arguments of Chapter 8, the probabil- 
ity of collision should be dt/r c and this is evidently equal to Widt. Since 
assumption (2) is invalid, however, t c cannot be related to the rate of 
decay of a current, and a new formula must be derived for the relaxation 
time. 

We shall next consider the law of decay for any current produced by a 

1 P. W. Bridgman, Phys. Rev., 31, 101 (1928). 
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disturbance of the equilibrium distribution. We shall deal first with the 
case of principal interest, in which the energy surfaces are taken as spheres 
and all directions of motion as equivalent. We shall later mention briefly 
another case in which t c depends on direction. For the spherical case, it 
is evident from symmetry that for the equilibrium condition there is no 
net current. We shall next suppose that we are dealing with a perturbed 
situation. Let us then consider the current due to the group of electrons 
hrii in an element of volume dSdQi. These will produce a current density 

5/ = {-eVilV)hni. (13) 

In time dt, a fraction WijdSljdt will be scattered into a new direction with 
velocity v, so that the change in current will be 

dhl = dt f [-e(Vj - v l )/V]&n i W l jdQ i (14) 

where J2(S) is the energy surface. For the cases of particular interest, 
the transitions are of such a nature that Wij is a function W(8) only of 0, 
the angle between initial and final velocities. Under these conditions, we 
may choose a set of spherical coordinates with = parallel to t>;. In 
terms of this coordinate system, we may write 

Vi = i>; (15a) 

Vj = (i z cos + i x sin 8 cos <p + i y sin sin <p)vi (15b) 

dQj = (m n Vi) 2 sin 8 ddd<p (15c) 

where i x , i y , i z are mutually orthogonal unit vectors. With the aid of these 
we find 

dhl = dtbni(-e/V)i z Vi C* (cos 8 - \W(8)(m n Vi) 2 sin M»2x (16) 

since the components along i x and i v integrate to zero. This expression 
may be rewritten as 

dbl = -dtSI T(l - cos^)^(%)^y 

= -dtiW (\ - cos 8) = -dtSl/r (17) 

where 

^o =fff / (8)dil j = 1/t„ (18) 

(1 - cos0> = (1//F )/*(1 - cos 8W(8)dQ. (19) 

These equations lead, as we shall shortly show, to the result that the current 
5/ decays with a "relaxation time" equal to t = t c /(1 — cos 8) where 



and 
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(1 — cos 6) is the average value of 1 — cos 6 for all the collisions. It may- 
be helpful to note that, if collisions were equally likely to all areas of the 
sphere, then (cos 6) would be zero so that t c and T would be equal. On the 
other hand, if all transitions involved only small changes in direction, 
(cos 0) would be close to unity and relaxation time t would be much larger 
than t c . 

In order to relate these equations to the relaxation of the current, we 
shall consider the current due to an arbitrary distribution f; over the energy 
shell, supposing still that fj<l. Each element of area 50; will then 
produce a current 5/,-, and in time dt the electrons in these elements will 
suffer transitions such that 

dhh = -Unit It. (20) 

The change in the total current / = YJ^li will then change by 

dl = Uddli = -(ZSIi)dt/T = -Idt/T, (21) 

or 

/ = -l/t. (22) 

This shows that the total current decays according to a simple relaxation 
equation for which the solution is 

J = 7 e-' /T . (23) 

Thus t is the relaxation time for the decay of the current and is, as we shall 
see, the correct quantity to use in calculating the mobility. We shall refer 
to it as the relaxation time or the mean free time. 

For the case of thermal vibrations, the transition probability is inde- 
pendent of and (cos 0) = so that t = t c . For impurity scattering, 
(cos 0) approaches unity for large energies so that t 3> t c ; for this case t c 
enters the theory only as an intermediate step. 

Another model, which we mention for completeness, has been consid- 
ered by various writers. For this model, t c is a function of P{ and 
(cos 0) = 0, for spherical energy surfaces. For more complicated sur- 
faces, it is supposed that scattering is of such a nature that after one col- 
lision an electron is equally likely to be scattered into any state in the 
energy shell. The rate of decay of current in this case is given by the 
formula 

/=-£«/</■*, (24) 

summed over all the elements of the energy shell, where Tj is the t c appro- 
priate to the state nearPj. 

The Effect of the Pauli Exclusion Principle. If a large fraction of the 
states in the energy shell are occupied, some electron transitions will be 
prevented because the end states will already be occupied. This preven- 
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tion may be taken account of very simply, and the surprising result is 
obtained that the relaxation process is not affected by having an appreciable 
proportion of the states occupied. 

The simplest way of seeing this result is as follows: If a transition from 
a state P* to a state Pj is prevented because Pj is occupied, then at the same 
time a transition from Pj to P, is also prevented. Suppose we disregard 
the restrictions imposed by the Pauli exclusion principle and permit both 
transitions to occur; then on the average the excluded collisions will 
cancel out in pairs and the relaxation process will proceed at the same rate 
as it would if the exclusion principle were enforced. Thus the relaxation 
of the current will proceed at the same rate whether the Pauli exclusion 
principle is enforced or not and, consequently, the relaxation time for a 
given energy shell will be unaffected by the number of electrons occupying 
states in the shell. We shall verify this conclusion analytically below. 

We consider once more the transitions from S&SQi to 8&8Qj. Since a 
fraction ij of the states in 8&8Qj are occupied, the number of possible transi- 
tions is reduced by a factor (1 — ij) and we obtain 

(d&d£li -* d&dSlj) = f;(l - ij)2 9i W %i d^d%dQj. (25) 

(In Section 17.7 we examine this problem from the point of view of anti- 
symmetrical electron wave functions and conclude that the semi-intuitive 
introduction of the factor (1 — ij) can be justified by a rigorous pro- 
cedure.) The rate of transition from 8&8Qj to 8&8Qi will be given by a 
similar expression with f,(l — f,-). As pointed out above, the quantum- 
mechanical treatment leads to the principle of detailed balancing by having 

PiWij = pjWh (26) 

so that the net rate of transition from StbSQi to 8&8Q, is 

(S&SQi^S&SQj) = [f,-(l -ij) -ij{\ - f i )]2pJP' ij B&dQ t *a j 

= (fj - i^lpiWijbSm^Qj. (27) 

Thus the net rate depends only on f; - ij and not on f; and ij individually. 
From this it is evident that, if the equilibrium represented by f is disturbed 
so that f; = f + f j,-, then the rate of transitions will depend only on fi;. 
Hence the rate of decay of a disturbance is unaffected by the degree to 
which the states are occupied and, consequently, the relaxation time is 
unaltered. 

In a semiconductor, in which the number of electrons varies with vary- 
ing impurity content, the change in number of electrons will be accom- 
panied by changes in the scattering mechanism. For such conditions, of 
course, the relaxation time will depend upon the impurity concentration 
and it will be impractical to change the electron concentration without 
changing the relaxation time. This does not invalidate the result obtained 
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previously in the form in which we wish to employ it. For this purpose 
we need the result that, for a given set of values of Wi^ the relaxation time 
does not depend on f . 

Similarity of Holes and Electrons. The similarity of the behavior of 
holes and electrons follows at once from the result that the rate of electron 
transitions between SSSOi and 8&5Qj depends on f,- — f,-. Evidently each 
electron transition may be described equally well as a hole transition in 
the opposite direction. Furthermore, the fraction of the states occupied 
by holes are 1 — f,- = f pl - and 1 — f,- = f p] : Consequently, the rate of 
hole transitions is 

(SSSUi±^8&8Qj) p = -(S&SQii^d&dQj) 

= (f pi - f pj )2 Pi W lj 8&8Q l 8Q j . (28) 

This last form is formally identical, except for the subscript p, to the 
expression for electrons. Thus we conclude that the relaxation of a given 
energy shell is the same whether it is occupied by only a few electrons, or 
only a few holes, or is partially filled. 

In Section 17.7, we deal with the case of a few holes in more detail and 
evaluate the transition probabilities directly. The conclusion is the same 
as that presented in this section, that the holes may be dealt with formally 
on the same basis as electrons. It is, furthermore, shown that their 
transition probabilities will be of the same order of magnitude. 

11.3 TRANSITIONS DUE TO THERMAL AGITATION AND 
DUE TO SCATTERING BY IMPURITIES 

In this section we shall treat the transitions produced by thermal vibra- 
tions and by ionized impurities in the lattice. 1 We shall treat the latter 
case first as it is conceptually somewhat simpler. For it we shall use a 
classical model and compute the scattering in terms of the electrostatic 
deflection of an electron as it moves by a charged ion. The problem of 
transitions induced by thermal vibrations of the atoms is essentially a wave- 
interference phenomenon and is treated as such from the outset. For 
both cases, expressions for t(v) are obtained for use in the next section in 
formulae for mobility and Hall effect. 

11.3a. Scattering by Ions; the Conwell-Weisskopf Formula. The prob- 
lem of scattering by charged donors or acceptors or other impurity ions 
may be dealt with by purely classical methods, the holes and electrons 
being treated as wave-packets. This procedure permits us to calculate the 
scattering on the basis of the deflections of charged particles moving in 

1 For a discussion in somewhat simpler terms with more application to metals, see V. YVekfr- 
kopf, American Journal of Physics 11, 1-12 (1943). 
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field of charged ions. It is necessary, however, to introduce simplifying 
assumptions in order to obtain the results in analytic form. The results 
obtained classically are a good approximation to the quantum-mechanical 
ones and, for the case of scattering by an isolated ion, the scattering 
formulae are identical. 
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Fig. 1 1-2 — Simplified Representation of the Scattering of an Electron by a Donor Ion. 



We shall discuss first the model shown in Figure 11.2, which represents 
an K-type semiconductor with a density Na of fully ionized donors. The 
deflection of an electron passing by one of the donors is shown. The 
model has been simplified by placing the donors on a simple cubic lattice 
with lattice constant «/. Evidently 



N d = l/«j 3 



(1) 



Furthermore, the other electrons have been uniformly spread about around 
all the other donors so that they are effectively shielded and the electron 
sees only the field of one donor. We can then consider that its deflection 
arises solely from the single ion in the center of the cell. Actually, the 
electron will not be perfectly shielded from the other ions and will be 
deflected to some extent by forces coming from other cells. For purposes 
of calculation we shall define the passage through each cell as a colli- 
sion and will, furthermore, take the average time between collisions as 
T c = aj/v. Even so the problem is still too difficult and we shall make 
the further approximation that the deflection 6 produced by passing through 
one cell is the same as it would be if the electron traversed an infinite 
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hyperbolic path in the presence of the same charge. This last approxima- 
tion leads us to overestimate the deflection due to the central ion and tends 
to compensate for the neglect of the effect of the other ions. 

Before the analytical consequences of the model are derived, it may be 
worth while to estimate the order of magnitude of the effects for a typical 
case: w-type germanium, with a density Nd = 10 15 cm -3 ; the value of 
ai is 1CT 5 cm or 10 3 A (angstroms), and k = 19. la The potential energy at 
a distance ai/2 is 

_ e 2 4.80 X 10- 10 , 

*>i = ~i — 7^ = e 77T, — , n \ -i^-s electrostatic volts 
K(ai/2) I9(ai/2) A W 8 

0.76 , 30 

electron volts = - — — - £Tat300°K 



(fli/2) A ( aj /2) A 

= 0.06kT at 3O0°K, (2) 

the intermediate expressions being included so that energies in electron 
volts and units of kT may be easily estimated for distances in angstroms. 
This calculation shows that, when the electron is at the center of the face 
of one of the cubes, its interaction with the ion is only T *g- of the thermal 
energy kT. Thus the interactions are relatively weak and the deflections 
will be large only for paths which carry the electrons into a radius of 
about (yg-) of aj/2. On the other hand, at low temperatures, such as 
20°K for example, the energy at the edge of the cell will be comparable to 
kT, and an electron which happens to lose energy near an ion may move 
around it in an elliptical orbit before escaping or becoming bound. For 
these low temperature ranges, the present theory will be a rather poor 
approximation. However, it is the best available at the time of writing 
and improvements will probably await the acquiring of better experi- 
mental data than is now available. There is an interesting range, however, 
at which the approximation is valid and at which impurity scattering at 
the same time is the dominant process. We shall discuss the comparison 
between theory and experiment in this range in later sections of this 
chapter. 

We next consider the deflection of an electron which starts at infinity 
with velocity o on a line such that it would miss the ion by a distance b. 
Figure 11.3 shows its hyperbolic trajectory. The diagram also shows the 
trajectory of a hole starting with the same velocity and also directed to 
miss by b. It is a consequence of the equations for the orbits, as we shall 
show, that, if the effective masses are equal, so are the deflections 6 in spite 
of the difference in the sign of the charges. The equation for an orbit in 

la Since this section was written, a revised value of 16 has been obtained by H. B. Briggs. 
See Chapter 9. 
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a central electrostatic field is given in terms of the radius r and angle <f> by 
the equation 

1 e 2f n 

- = ± — J + A cos <t> (3) 

where the + sign holds for attractive forces (electron in Figure 11.3) and 
the — sign for repulsive force (hole). / is the angular momentum which 
may be written as 

/ = mr 2 4> — xp v — yp x = mbv. (4) 




Fig. 11-3 — Positive and Negative Particles of Equal Masses and Energies are De- 
flected through the Same Angle 6 in Traversing Infinite Hyperbolic Orbits., 

The last equation follows from the fact that the angular momentum is 
conserved and has the magnitude mbv when the particles are moving on 
the asymptotes ; since only J 2 enters the formula, the algebraic sign of / 
has been disregarded. 

Our problem is to evaluate in terms of b. It is evident that 
6 = 24>n — it and 6 = t — 74> p for the two cases where <$> n and 4> p are the 
angles of the asymptotes. The values of 4> n and <j> p are found as follows : 
As the particles approach infinity, # approaches its maximum magnitude 
and we have 



= —5- + A cos <t> n ; = 
K 7 



e 2 m 



r2 4- A cos (pp. 
K J 



Near the maximum, the right side may be expanded as a series in <j> 
— (j> p , leading to 



1 



• (^ _ 4>n)A sin <j) n ; 



1 



— (<t> — <t>p)A sin 4> p . 



(5) 



0«or 



(6) 
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These last may be solved for (<j> n — <f>)r and (<t> p — <t>)r, quantities which 
are evidently both equal to b for large values of r. This gives 

b — \/A sin 4> n , b = 1/ 'A sin 4> p . (7) 

A may now be eliminated from the equations giving 

e 2 mb e 2 mb 

COt (f>„ = =5- ' COt (j) p = — 75- 

kJ K J 

(8) 



e 2 



Ktnv 2 b nmv 2 b 

For tPb = 00 , cot $„ and cot <f> p are zero corresponding to 4> n — <t>p = t/2 
and to 6 = and thus to no deflection. For finite values of v 2 b, <£„ and 
(j> p differ from v/2 by equal amounts with 4> n > x/2 and <t> p < ir/2 thus 
leading to equal deflections 8. The relationship between 8 and <t> n and <£p 
leads for both cases to 

tan 0/2 = e 2 /Kmv 2 b. (9) 

In order for an electron to be deflected from its initial direction through 
an angle between 8 and 8 + dd, corresponding to a solid angle 2x sin ddd, 
it must be directed towards an annular ring lying in a range b to b + db 
where 

e 2 8 e 2 

db = 5 </ cot - = - 2 cosec2 (*/2)<#. (10) 

Kmv 2 Ixmv 

The projected area 2irbdb of this annular ring at which the hyperbolic 
asymptote must be pointed in order to produce deflections in the solid angle 
27r sin 8d6 is defined as c(8)2ir sin ddd, where <r{8) is called the differential 
cross section. From this definition we obtain 

This is the Rutherford scattering law. The same relationship is also 
obtained by solving and interpreting Schroedinger's equation. 2 

The formula indicates that for small angles of scattering, a approaches 
infinity; this corresponds to the fact that there is an infinite cross section 
for large misses which produce negligible deflections. For 8 approaching ir, 
corresponding to reversal of motion, a = r T 2 /16 where r ff is the radius at 
which the potential energy equals mv 2 /2. 

For our application, the quantity of particular importance is 1 — cos 8 

2 See, for example, L. I. Schiff, Quantum Mechanics, McGraw-Hill Book Co., New York, 
1949, equation (20.11). 
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and we have 

1 — cos 6 = 1 — cos (2<t> n — x) = 1 + cos 2<p n = 2 cos 2 <j>„ 



l + tan 2 4> n /~»2*\s 






(12) 



which expresses the quantity 1 — cos 8 in terms of b and v. The formula 
applies to both holes and electrons. 

We must next average 1 — cos 8 over all collisions. To do this we give 
each "radius of miss" i a weight proportional to its projected area and for 
simplicity we replace the square of edge aj by a circle of radius aj/2. We 
then say that the probability that the asymptote is directed so as to pass 
in a range db is simply (2iridi)/ir{ai/2) 2 . The value of (1 — cos 8) then 
becomes 



4 r»i/2 
(1 - cos 8) = 5 / (1 



cos 8)2iridb 



where 



-(§H' + (!) 2 ] 

2e 2 

S = mv 2 /2, &x = (14) 

Kai 

For an electron with & = kT'm the example discussed with equation (2) 
<1 - cos 8) = 2(^) 2 In [1 + (32) 2 ] = &. (15) 

Thus an electron is in effect only g^ scattered in traveling a distance aj so 
that the mean free path for the example chosen will be 65«j or 6.5 X 10~ 4 cm. 
From formula (17) of Section 8.8 we see that this mean free path is about 
40 times the mean free path of a thermal electron. At room temperature, 
therefore, impurity scattering is unimportant in semiconductors with low 
impurity contents. 

In terms of t c and (1 — cos 8} the relaxation time n for impurity 
scattering is found to be 

t/ = T.A1 - cos 8) = 1 0Q 8 — l -^T^ ■ (16) 



<"[' + (I/] 



This formula was introduced by Conwell and Weisskopf 3 in calculating the 

3 E. Conwell and V. F. Weisskopf, Phys. Rev. 69, 258A (1946); 77, 388-390 (1950). The 
problem is mathematically identical with that of the scattering of electrons in an ionized gas 
which has been solved to the same approximation by S. Chapman, Monthly Notices, R. A. S. 
82, 292-297 (1922). 
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effect of impurity scattering upon the mobility. Owing to differences in 
the approximations their expression differs by a factor 4/jr leading to 

_ la! (2S/S f ) 2 
Tl vr In [1 + (2S/S/) 2 ] " U ; 

We shall return to this formula in the next section and average it appro- 
priately over the energy distribution, thus obtaining the Conwell-Weisskopf 
formula for mobility. 

If several types of impurity are present, with charges varying both in 
sign and magnitude, substantially the same approximations may be carried 
out. For this case, ai should be computed from the total density of 
scattering centers without regard to sign or magnitude so as to divide the 
crystal in this way into one cube for each center. The contribution of each 
center, however, is approximately proportional to &i 2 as may be seen from 
the formula for (1 — cos 6). Since |d>/| is proportional to e 2 /n for single 
charged ions, &i 2 will be proportional to Z 2 for an ion of charge Z e . Thus 
the average charge to use in the formula will be approximately the root 
mean square charge of the scattering ions and the number to use will be 
the total number. 

There is strong theoretical and experimental evidence that donors and 
acceptors contribute in an important way to scattering even when their 
charges are neutralized by bound holes and electrons. This possibility is 
discussed by Pearson and Bardeen 4 who conclude that in silicon equal 
contributions will be made by an ionized and a neutral center at about 
100°K. At lower temperatures, the ionized center may be more effective. 
The scattering is associated with exchange effects between the incident and 
the bound electron and is similar to problems already investigated for the 
collision cross section of hydrogen atoms. 5 The application of these 
important theories to semiconductors is not at present so well developed, 
however, as to justify a discussion of the other forms of impurity scattering 
at this point. 

11.3b. Scattering by Thermal Agitation; the Deformation Potential 
Formula. Thermal vibrations of the atoms distort the periodic potential 
field in which the electrons move. As a consequence, the Bloch wave 
functions are no longer exact solutions, and an electron started with one 
value of P makes transitions to other states and is, in effect, scattered. 
This situation is closely analogous to the reflections and attenuations which 
may occur in a non-uniform but lossless transmission line. At the locations 
where the impedance changes, mismatches of impedance occur and the 

4 G. L. Pearson and J. Bardeen, Phys. Rev. 75, 865-883 (1949). 

5 H. S. W. Massey and C. B. O. Mohr, Proc. Roy. Soc, London 132A, 605-630 (1931); 
136A, 289-311 (1932). 
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wave is reflected. We shall give a treatment in this section based on the 
idea of mismatch and reflection and thus show how the various important 
quantities enter the formula for the mean free path. 

As a result of thermal agitation, the atoms in the crystal are vibrating in 
a random fashion with respect to their neighbors. The description of this 
pattern and a determination of what aspects of it are most important in 
scattering the electrons are treated in detail in Chapter 17. We shall here 
summarize and simplify the physical picture which results from that 
analysis. In the first place, the important distortions of the atomic 
arrangement are those which produce uniform effects over a distance of 
about (j)X for the electron wave. This is a familiar result from wave 
theory. In a material like a glass or a liquid, for example, there is a great 
amount of disorder on a scale of atomic dimensions. A wave length of 
visible light, however, extends over several thousand atomic spacings and 
as a result the effects of the atoms on the waves are averaged over groups 
containing of the order of (1000) 3 atoms. For such large numbers, the 
statistical fluctuations are negligible, the material appears to be homo- 
geneous, and no appreciable scattering of light occurs. On the other 
hand, if the transmission properties of the medium change by only a small 
fraction of themselves in a given wave length, then the situation is essen- 
tially that of a tapered transmission line or a tapered waveguide. For such 
cases the traveling wave can adjust itself as it progresses and almost no 
reflection occurs. In quantum mechanics such cases are the ones best 
adapted to treatment by the Wentzel-Kramers-Brillouin approximation, 
which is discussed in most quantum-mechanics texts. If, however, there 
is a change in property of the material in a region which extends over 
(j)\ there will be a relatively strong reflection: Under these conditions, 
the waves reflected from the entrance side and the exit side will combine 
constructively since the reflection conditions at the two faces give reflec- 
tions differing by 180° and the round trip through the region adds another 
phase lag of 180°, making 360° altogether. 

In the correct quantum-mechanical treatment, the problem of determin- 
ing the important scale for distortions is eliminated by a mathematical 
device. Essentially what is done is to make a Fourier analysis of the 
deformation. Each component corresponds to a possible normal com- 
ponent or standing wave pattern for the atomic vibrations, like the waves 
of Figure 5.3 or 5.6. Each of these vibrations is thermally excited and has, 
at the temperatures of most interest, an average energy of kT. The 
scattering of the electrons is then computed by adding the effects of the 
individual waves. From this it is found that the most important waves 
are those whose wave length is somewhat less than the electrons' wave 
lengths. A longitudinal wave of this sort will produce alternating regions 
of compression and dilation somewhat less than ^X (X for the electron 
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wave) wide, in general agreement with our conclusion that disturbances 
about j\ wide are the most important. 

Furthermore, it should be noted that the speed of a thermal electron is 
about 10 7 cm/sec whereas that of the sound wave is about 5 X 10 5 cm/sec, 
or 20 times less. Consequently the deformation in the crystal may be 
regarded as static while the electron moves by. The relatively slow 
movement of the pattern will, however, produce a "Doppler effect" on the 
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atomic spacing produced by a longitudinal wave has the effect of introducing a 

varying potential energy.) 



scattered electron wave and thus change the energy of the electron. As 
discussed in Section 11.2, this effect is small and can be neglected for our 
purposes. (See Problems for a further discussion.) 

We must next ask what effect a deformation of the lattice will have upon 
its transmission properties for an electron wave. The answer is found by 
considering the dependence of the energy boundaries upon lattice constant 
as illustrated in Figure 11.4. Here a one-dimensional section of the crystal 
is shown. In it, a longitudinal wave progressing in the ^-direction is 
represented, for purposes of illustration, as changing the degree of compres- 
sion abruptly so that the crystal is divided, momentarily, into three parts 
having lattice constants along the #-axis of a\, «o> and a 2 respectively. We 
now imagine that this pattern is frozen and consider its effect on the motion 
of an electron. Part (c) shows the dependence of the energy-band bound- 
aries upon a x , the lattice constant along the A;-axis, it being supposed that 
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a v = «z = «o> where #o is the equilibrium value or true lattice constant. 
(This plot differs from Figures 5.4 and 5.5 for which the variable was 
a = a x = a y — a z .) The changes of lattice constant will thus change the 
energy-band structure in the crystal as shown in (b). The changes in 
energy-band boundaries may be seen to be substantially equivalent to the 
introduction of a varying potential energy for an electron, and we shall 
refer to them as deformation potentials. An electron of energy &', as it 
moves through the crystal, will have variations in its kinetic energy 
(energy in excess of £> c ) as shown. The discontinuities in kinetic energy 
will give rise to reflection coefficients at the transition boundaries which are 
closely analogous to those for a discontinuous transmission line. Entirely 
similar considerations apply to the case of the hole and we need not con- 
sider it separately. 

We shall illustrate the reflection process for an electron by considering a 
simplified one-dimensional problem in which an electron wave in regions with 
kinetic energy (K.E.) a = P a 2 /2m strikes a region b in which its kinetic 
energy is (K.E.)b = Pb 2 /2m and is partially transmitted and partially 
reflected. We shall take the boundary as occurring at x = and shall 
write the wave functions as follows : 

Incident wave: e iPaX/n 1 (18a) 

• D /* f region a, x < 
Reflected wave: Z?e -,p ' ,x/ * J (18b) 

Transmitted wave: Te iPhx/fi region £, x > 0. (18c) 

The electron current density (as the reader may verify from the equations 
of Section 15.4) is proportional to 1 — \R\ 2 so that \R\ 2 is to be taken as 
the fraction of incident current which is reflected. The boundary condi- 
tions at x = require continuity of v|/ and dif/dx and thus lead to 

1 + R=T, (iPJh){\ -R) = (iP b /h)T (19) 

and hence to 

a formula strictly analogous to the reflection formula {Z\ — Z 2 ) 2 /(Zi + Z 2 ) 2 
at the junction of transmission lines of impedance Z\ and Z 2 . (For the 
transmission-line problem, continuity requirements for voltage V and 
current / = ± (1/Z)^ replace those for i|f and ±A|/.) For a small change 
in energy S& c , we may write 

8& c = P5P/m n (21) 

and 

, |2 _ (SPl _ m n 2 (8& c f 
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We must next consider the value of the deformation potential S$ c to be 
expected for a region about \\ in size. In Figure 11.5 we have arbitrarily 
divided the crystal into cubes b\ along the edge having volume (£X) 3 = V§. 
We shall assume initially that each cube contracts and expands inde- 
pendently of the others, considering later how this assumption may be 
corrected. Our problem is then to find the probability that, if the incident 
wave falls on a particular cube, it will be reflected. To estimate this effect, 
we shall re-express 8& c in terms of a change in volume. For a crystal with 
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Fig. 11-5 — For Purposes of Calculation the Crystal Is Divided into Cubes Which 

Expand and Contract Due to Thermal Agitation. Each Block Acts Independently 

in Scattering the Electron Wave. 



cubic symmetry it is evident that 86> c must depend symmetrically upon 
a x> a y , and a z . The only linear symmetric form may be written as 

S&c = (a x + a y + a z — 3a )5>i n /a (23) 

where &i n is a constant for small changes of the as. But this form is 
equivalent to 



8tb c = d> ln — 



(24) 



where V is the original volume of the unit considered and F + hV the 
distorted volume. (For the case of holes we similarly conclude that 

8V 



8(b v — ©ip 



V a 



(25) 



a more complete discussion of both cases being given in Chapter 17.) 
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The volume considered in Figure 11.5 has many degrees of freedom, in 
fact three times as many as atoms, and it contains kTof thermal energy for 
each of these modes. Of all the possible modes of distortion, however, the 
one of most importance is that which produces a dilation (or contraction) 
of the whole unit and thus contributes to the average value of 8& c . All 
other independent modes will give zero average dilation and will correspond 
to fine-scale fluctuations of potential of the sort which we decided to neglect. 
For the deformation of importance the stored elastic energy is 



\pW =±B — 5V (26) 



where p — — BSF/Fq is the pressure and B is the bulk modulus or the 
reciprocal of the compressibility. This energy must, on the average, be 
^kT so that 

This leads to a probability of reflection of 

, , 2 _ m n 2 &m 2 kT _ m n 2 & Xn 2 kT 

11 ~ 4P*r B ~ 4P 4 (^X) 3 J S' { ' 

In terms of \R\ 2 we estimate the mean free path / as follows: By defini- 
tion, the probability that an electron be scattered while moving a distance 
8x is bx/l. In traversing the cube the probability is about 2\R\ 2 , that is, 
\R\ 2 from front and back sides. Thus we have 

h ] = « 
or 

,_^| 8 ,._«^M *»*, (29) 

the last equality following from \P — h. 
The exact expression for / derived in Chapter 17 is 

I - -t^r (30) 

where cu is the elastic constant for a longitudinal wave propagating in the 
[110] direction and is about 25 per cent larger than c\\. The ratio of the 
two values of / is 

/ (approx.) = 2(2tt£) 4 £ ^ 2 
/ (exact) tcu 
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for B = \{cn + 2fi 2 ) — ^Cn and b = \. This agreement is as good as 
could be expected from our simple model, which was discussed for the pur- 
pose of illustrating the principles rather than for calculating the exact 
formula. 

The exact treatment leads to the result, which we shall not attempt to 
show from the simple model, that the scattering is substantially spherical 
so that the direction of motion after collision is independent of the direc- 
tion of motion before collision. This leads, as discussed in Section 11.2, 
to the result that (cos 6) — for this case. In accordance with these 
results we have for this case 

t = t c = l/v and (1 - cos 6) = 1. (32) 

Certain flaws in the model should be mentioned. It is incorrect to treat 
the blocks as independent since the surrounding material exerts a restrain- 
ing force which approximately doubles the stiffness and thus tends to 
increase /. The scattering treatment of one block is so very crude, how- 
ever, that it would not be worthwhile to try to improve the treatment of B 
without adding further refinements. Adding further refinements would 
soon lead to the path followed in Chapter 17, however, to which the reader 
who is interested in the basic theory is here referred. 

In the next section, formula (30) for the mean free path will be used to 
obtain an equation for the mobility, equation (30) of Section 11.4. An 
important test of the theory can be made from values of 6>i„ and the cor- 
responding quantity for holes, and & ip . These values can be determined 
from the measured mobilities, elastic constants, and assumed values of the 
effective mass. The sum of the two values may then be compared with 

&iG = d{&c-&v)/d\nF (33) 

where V is the volume of the crystal and &\q is the change in energy 
gap per unit dilation. In Chapter 12 we shall discuss various ways in 
which §i<? may be determined directly. If §i„ and & ip have opposite 
signs, as shown in Figure 11.4, then the relationship 

ISiel = |Si»|+|Si,| (34) 

should hold. The comparison of § 1G with |S ln | + \&\ v \ presented in Chap- 
ter 12 shows that the theory of scattering by deformation potentials is 
probably correct, at least in its general features. 

11.4 AVERAGE MOBILITIES, MEAN FREE TIMES, AND 
HALL CONSTANTS 

As a result of the analysis in Sections 11.2 and 11.3, we can suppose that 
there is a relaxation time t(§) associated with the states in an energy shell 
bounded by surfaces & and & + d& and that this time is independent of 
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the degree to which the states are occupied. According to the develop- 
ment of the concept of the relaxation time, any current produced by a 
disturbance of the equilibrium among states will decrease by a fraction 
dt/r(S) in time dt. We shall now use this result to calculate the steady- 
state currents. 

The conductivity and other quantities of interest are determined by 
calculating the current due to the electrons in a partially filled Brillouin 
zone when their equilibrium is disturbed by applied fields. A magnetic 
field alone, as discussed in Section 7.4, causes an incompressible flow of 
points in the Brillouin zone along the equal energy surfaces; along a given 
energy surface the density of occupied states, being f (&) times the density 
of states, is constant; consequently, the flow produced by H does not dis- 
turb the equilibrium. An electric field does alter the distribution and the 
non-equilibrium distribution so produced can then be affected by magnetic 
fields in the manner discussed in Chapter 8. 

We shall introduce a symbol f'(P) to represent the fraction of the quan- 
tum states, having crystal momenta near P, which are occupied by elec- 
trons under non-equilibrium conditions. For small fields, f' will differ by 
a very small amount from the Fermi-Dirac distribution function f(P) = 
i[&(P)] and we may write 

f'(P) =f[&(P)]+U(P) (l) 

where fi represents the small disturbance produced by the field. 

We shall use Figure 11.6 in deriving the effect of the applied field. The 
coordinates are P-space and two energy surfaces corresponding to & and 
& + d& are shown. The total number of quantum states in the shell be- 
tween the two surfaces is denoted by dS and the number in any small ele- 
ment by ds. The number of electrons occupying states in ds is f'(P)ds. 
These electrons contribute a current density, evaluated according to 
equation (1) of Section 6.1, of 

dl, = ^ vi'ds, (2) 

since each of them has the velocity v = VpS(P) evaluated at the P cor- 
responding to ds. The total contribution of the shell to the current 
density is 



dl s 



= -} - f vi'ds. (3) 

V ./shell dS 



Under equilibrium conditions, this integral is zero because of the symmetry 
of the energy surfaces: For all Brillouin zones, regardless of the crystal 
structure, &(P) = §(— P); consequently the surfaces have a center of 
symmetry and for each element ds located at P and having velocity v 
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there is always an equal element ds located at — P which has an opposite 
velocity v' = — v and these elements will cancel in the integral. Hence, 
the current arises entirely from f t . 

The disturbance in equilibrium due to the electric field is represented 
by P = — eE on the figure. In time dt, the flow produced by P moves elec- 
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o 
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f=f+f,-~ 




THICKNESS OF SHELL = dP. 

= d6/|v p S| = d&/v 



f'(P-Pdt)= 
f'(P)-Pdt-VpfV) 



dS = TOTAL NUMBER OF 

STATES IN SHELL,d& 



CRYSTAL MOMENTUM, P x 

Fig. 11-6 — Disturbance of Equilibrium by Electric Field. 

trons so as to replace the f (P)ds electrons in ds by the i (P — Pdi)ds elec- 
trons in the volume displaced by —Pdt from ds. This causes the number 
of electrons in ds to change at a rate (due to E) of 



\.at Jduetoi' L"t J 



ds 



due to E 

-[P ■ V P i(P) + P 



-P ■ V P f'(P)ds 
V P h{P)]ds. 



(4) 



The second term in this expression involves a product of P, which is propor- 
tional to E, and fi, which is also proportional to E, and thus represents a 
quadratic effect, which we can neglect in a linear theory. We shall, there- 
fore, calculate the rate of increase of current due to Vpi(P) and shall later 
equate this to the rate of decay of current due to collisions in order to 
determine the steady state. Because f (P) depends only on S(P), we may 
re-express V P f[&(P)] as follows: 

V P f[S(P)) = Vp&(P)df(&)/d& = v-=^ (1 - f)f, (5) 
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where we have used the useful relationship 

^ = jg }[exp(g - S p )/kT\ + I}" 1 = - ^;f(l - f). (6) 

From these we obtain 



(7'') 

\dt / d 



= -P ■ V P i{P) = tj£-E ■ rf (1 - f). (7) 



' due to E 

The rate of change of dl s due to E is evidently 

[j^l =M^7 f 'l *• 

\_dt J due to £ L r J L"< Jdueto£ 

and by using the foregoing relationships, this may be reduced to 



<? 



\i dI °\ = T¥* v {E ' y)f (1 - i)ds - (8) 

Y.dt JduetoE klV 



It should be pointed out that the situation shown in Figure 11.6 has been 
exaggerated for purposes of exposition by making the vector Pdt very 
large. Actually we have used the figure to calculate a derivative and this 
corresponds to the case for which Pdt approaches zero. Under these con- 
ditions the rate of change oii'ds arises from the value off just on the two 
surfaces of the shell. (We have also shown in the figure a formula for the 
thickness of the shell. This is for use in the problems at the end of this 
section.) 

The fact that rate of change of equation (8) depends upon the degree to 
which the states are occupied through the simple factor f (1 — f) has the 
following interpretation: When f is very small, as it is for a high resistance 
w-type semiconductor, f(l —i)ds is approximately ids, which is simply the 
number of electrons in the states ds; the contribution of the states ds is, 
therefore, simply proportional to the number of electrons in them. When 
an appreciable fraction of the states are occupied, however, the efficiency 
is reduced with which they can use their electrons in producing changes in 
current; in particular if the states ds were completely occupied, no change 
in their current could be produced and the effect of the electric field would 
be simply to replace one electron by another. The proper efficiency factor 
is evidently (1 — f) since this gives the probability that an accelerated 
electron will produce a change in the situation rather than simply replac- 
ing another electron. Furthermore, a similar situation applies for holes; 
thus as discussed in Chapter 10, (1 — f) = f p is the probability that a 
state is occupied by a hole; hence the expressions f(l — f) and (1 — f P )f P 
are equal and equation (10) is symmetrical in holes and electrons. The 
same reasoning may be applied to holes, in which case the effectiveness 
factor is (1 — f p ) = f. We shall use the interpretations of ids as number 
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of electrons and (1 — i)ds = f p ds as number of holes in later integrated 
expressions for the current and the conductivity. 

The rate of change of the total current due to the shell d& is obtained 
by integrating (8) over the quantum states in the shell. (The general 
results are more complicated than needed for our purposes: Thus if the 
electric field is in the ^-direction, for example, the change in current may 
have a component in the jy-direction ; it will have such a component if the 
average value of E x v x v has a _y-component. We shall give the general 
formula for such cases in the Problems and restrict the treatment here to 
the symmetrical case of spherical energy surfaces.) For the case of spheri- 
cal energy surfaces, the average value of v x v y over the surface of the sphere 
vanishes by symmetry, and the average values of v x 2 , v y 2 , and v 2 are equal 
by symmetry; furthermore, since v x 2 + v y 2 + v 2 = v 2 , where v = \v\, 
the average value of v x 2 = v 2 /3. For this case integration over the shell 
d& leads to 



v 2 



Jt X Jdueto£ * kTV 3 



f (1 - i)dS (9) 



for the ^-component of current due to a field E x . 

Under steady-state conditions the increase in current in shell d& due to 
E must be just balanced by the rate of decay. This latter is simply 
(dIs)/T(&) so that we have 



-D 



M J due to E L at -i collisions 



dlsx 



+ \-r.dI& 



2 2 1 

= E x -^—/-i{l - i)dS - - dI Sx 

klV 3 T 



leading to 



e 2 



dI Sx = E x -^f^rv'i (1 - i)dS, (10) 



where dls is the contribution of the dS states in shell d& and dls x is its 
^-component. The total current is obtained by integrating this equa- 
tion (10) over all energies. 

11.4a. Non-degenerate Semiconductors. For the case of semiconduc- 
tors, the element of integration can conveniently be changed by using the 
approximations of equations (2) and (4) of Section 10.1: 

f(f- l)dS £ dS J 
w-type: = iy=^„ & (Ha) 

f(f - l)dS . dS , „,, N 

p-type: y±— = i P y = dp & (1 lb) 
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where dn & and dp & are respectively the contributions to the electron and 
hole densities arising from the energy shell d&. 

Before carrying out the integration for current and conductivity for the 
semiconductor case, we shall discuss the Hall angle 8 and obtain a corre- 
sponding expression for it so that both integrations may be carried out at 
the same time. We shall deal with the case of a small magnetic field H z , 
at right angles to E x , and shall neglect any effects proportional to H 2 . 
Under the influence of E x alone the shell d& acquires a current dls- For 
the spherical approximation, this current is parallel to E, its only com- 
ponent being 

dI Sx = E x ^-- r v 2 r(&)dn s . (12) 

The magnetic field rotates the entire pattern of states at rate oi = eH z /mc 
and thus tends to build up a current dlsy at a rate equal to udlsx- This 
component of current, like any component of current produced in shell 
d&, decays at a rate equal to its steady-state value times 1/t and is thus 

dlsy = urdlsx- (13) 

(The effect of H z upon the ^-component of current is quadratic in H z and 
may be neglected.) The total x and y components of current are thus: 



/» = 



E^ 
3kT 

__ E x e 2 u 
y ~ 3kT 



Jv 2 T(&)dn & (14a) 

fv 2 r 2 (&)dn & . (14b) 



The integrals represent the sums of p 2 t and v 2 t 2 over all the electrons in 
unit volume of the crystal and are thus simply n times the average values. 
Denoting the average over all the electrons of the distribution by {v 2 r) 
and (v 2 t 2 ), we may write for the conductivity a and the mobility /* (omitting 
the subscript n for brevity, it being understood that electrons are 
concerned). 

h e(v 2 T) 

a = E x = ne -3kT =ne,X - (15) 

The Hall mobility mf and Hall mean free time th, defined in Section 8.9, 
are 

_ rf _ d^ _ co(eV) _ e(v 2 T 2 ) 
1X11 ~ H z ' HJ X - H z {v\) ~ m( V 2 r) (16a) 

mcd {v 2 t 2 ) 
T H = ~ =717 = 7jOT- ( 16b ) 
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We shall next carry out the necessary averages for (15) and (16) by 
integrating over the electron distribution. The number of electrons hav- 
ing speeds in the range dv about v is given by the approximate expression 
(2) of Section 10.1 for f and by the volume 4irP 2 dP( <x v 2 dv) of momentum 
space. Hence the relative probability that an electron has its velocity in 
the range v to v + dv is 

e-" a VA (17) 

and the average value of any function g(v) is 



X" 



-mv"l2kT n 



J0 'givVdv 

<*(»)> = —7^ (18) 

e-™ !/2fc VA 



Jo 



the denominator serving to normalize the probabilities. If g(v) = v n , the 
integrals become standard forms and we have 

(p») = vr n ^ \J . (19) 

2 • 

where Vt is the thermal velocity (actually root mean square along one 

axis) : 



v T = V2kT/m, (20) 

and x\ = T(x + 1) for non-integral values of x. The integrals with 
which we shall be concerned can be evaluated with the aid of a general 
formula and two particular values : 

(* + 1)! = (* + 1)*!, 1! = 1, ?! = V^r/2. (21) 

We shall first test these formulae by finding the average value of \rriv 1 ; 
this is known to be -f^Tfrom the general equipartion theorem which gives 
\kT to each component of motion; thus we should find that (v 2 ) = 3kT/m. 
Our formula checks this by giving 

. 2 , 2 f! 2kT |-|! 3kT 

(v 2 ) = v T 2 f-= hr~ = 22 

f! m f! m 

Inserting m(v 2 ) for 3&Tin (15), we obtain 

<Kp 2 t) eT„ e 1/t) e 

a = ne — -57 = ne — » n = — —j— = — t„ (23) 

m\v ) m m {v ) m 

where t„ is the appropriate mean free time to use for mobility. We see 
that t,, is a weighted average of the relaxation time over the particles of 
the distribution, each particle being given weight v 2 . It is similarly seen 
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that th is the quotient of the weighted average of t 2 divided by the 
weighted average of t. The Hall-effect formula, derived in Chapter 8, 
can, therefore, be written as 

t„ n{±e)c n{±e)c (vt) 
since our formula shows that th/t, is 

UK-IE- fr^X" 2 ) _ W ■ /W nn 

M T M <trV> 2 <» 2 > • V(^)/ ' ^ 

The second form of equation (25) shows that th/t m is the weighted 
average of t 2 divided by the square of the weighted average of t. Since 
a mean squared value is always greater than the square mean value, as 
is shown in Section 14.3 for a different example, the Hall mobility is always 
greater than the drift mobility. This conclusion depends, however, on 
the assumption of spherical energy surfaces and a mean free time de- 
pendent on energy only. In Section 12.9 we shall quote data which throws 
doubt upon the validity of these simplifying assumptions and shall discuss 
possible explanations. 

We shall next evaluate these averages for several dependencies of t(S) 
on the energy and shall then derive expressions for degenerate cases. 

FORMULAE FOR NON-DEGENERATE SEMICONDUCTORS 

t = constant. The simplest case is that for which the mean free time 
is constant with value Tq. For this case we have 

t m = r H = To (26a) 

m = er /m (26b) 

Rh = T ", = -j^— (26c) 

Thermal Vibrations. The next case is that appropriate to scattering 
by thermal vibrations. For this case the mean free path is constant and 

t = I /v. (27) 

Using the notation ol equation (15) that ( ) denotes an average of the 
bracketed property over all the electrons in the distribution giving each 
electron equal weight, we find 

(h) lv T \\ I 4/ 

T M = — T7 = — 5 — :r, — T = (Zoa) 

{V) v T --f\ 3\^r WlirkT/m 

vt 



4 

4 



„„£*_ 0.753^--^= (28b) 
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m _ x M - </,) 2 -(I!) 2 " 8 " U8 (28c) 

3x 1.18 

* H = M^ = ^^' (28d) 

The formula for / derived on the basis of deformation potentials and 
quoted in Section 11.3b is for electrons 

_ ntfcu 
&i„ kTmn* 

where cu is the elastic constant for [110] longitudinal acoustical waves, S ln 
is the derivative of & c with respect to the logarithm of the volume of the 
unit cell, and m n is the effective mass. Inserting this in the formula for 
H gives 

2V^ h\i % 

Hn = r~ T^~2 W[M 1 • ■ ( 30a ) 

Expressed in cm 2 /volt sec for /*, dynes/cm 2 for en, electron volts for & in 
and °K for T, this becomes 

M = 3.2 X Kr s cii(m/m t L)»/& ln i 1*. (30b) 

Similar expressions apply to holes. 

Impurity Scattering; the Conwell-Weisskopf Formula. For this case 
the dependence of t upon v is given by the complicated relationship (17) 
of Section 11.3, 

W2S\ 2 



T/ 



'"[-(I) 



(31) 



where the symbols are defined in terms of the density Ni of scattering 
impurities and the dielectric constant k as follows : 

ai = Nr H , S 7 = 2*y«a 7 , £ = m n v 2 /2 (32) 

where m n is the effective mass for an electron, n can be rewritten as 

t/ = -t T^FT (33) 



4- (1)1 



The quantity (2S/S/) was found to be about 30 for a typical high resis- 
tivity sample at room temperature so that the logarithm term has a value 
of 6.8. This term varies slowly with S, in fact a change of e ±1 in £ will 
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make the logarithm vary from 4.8 to 8.8. For this reason, no great error 
will be made by replacing the logarithm term by a constant corresponding to 
its value at the most important part of the range while averaging v 2 t. 
Now the average of v 2 t is the same as the average of 2m n &r, and the 
number of electrons in a range dS> is proportional to 6> exp ( — &/kT)d&. 
Thus the integrand in (v 2 t) involves the factors 

(S» from Tj ) (S from v 2 )&^~ &/kT = & 3 e~ &/kT : (34) 

this has its maximum at d> = 3kT. Hence we replace & by 3kT in the 
logarithm term and write Tj in the form 

m n 2 K 2 /2 v N ie 4 



Tl 



= Bv\ B = 



In 



1 + (it) 



(35) 



Since t is now proportional to a power of v, the previously derived formulae 
are applicable and we obtain 1 

TM = ^ = BVT* ~ = 4= B °T 3 i ( 36a ) 

<» ) I ! V x 

8 /2 K 2 (*r)^ 



ln L 1 + WJ 

w _ VL _ WV> _ ill! _ 315x _ ! 93 . (36c) 

M " t, ~ Z?W _ 3! 3! _ 512 ~~ 1 -^' ^ 6CJ 

1 93 
Rh = -^ V - (36d) 

w(±^)c 

Intrinsic or Near-Intrinsic Range. In Chapter 8, the formulae for con- 
ductivity and Hall coefficient for the case in which both holes and electrons 
are present were derived in terms of n and 6. We shall repeat the formulae 
here modified in keeping with the notation of this chapter: 

<r = nen n + pen P (37) 

6 —{nen n eT H nH/m n c) + {.pen p er H pH/m p c) 



-nTunTHn/mn + pT^pT Hp/»lp 



(38) 



ec{mp n /m n + pTy. p /m p ) 

1 The formula for M is due to E. Conwell and V. F. Weisskopf, Phys. Rev. 69, 258A (1946); 
77, 388-390 (1950). 
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If the ratio of th/t» is the same for holes and electrons, this formula 
becomes 

„ Tjy — nb 2 + p 

Rh = — , . , .f (39) 

t m (nb + pfce K ' 

where b = ix n /y. p . For thermal scattering, t#/t m = 3tt/8, and the formula 
reduces to that quoted in Chapter 8. 

11.4b. The General Case. For the case of a degenerate semiconductor, 
similar expressions can be obtained, but, as discussed previously, the 
effectiveness of electrons must be weighted by 1 — f = f p and holes by 
(1 — f p ) = f. The corresponding general expressions derived from equa- 
tion (10) and slight generalizations of (14a) and (14b) become for H-type: 

e(f p v 2 T) er { 

u = rie — " ' 

and 



= w -3ir" = w T' (40) 



T ff = <fy<r 2 >/<V 2 T>, (41) 

where again each conduction electron is given equal weight in taking the 
average. The expressions suggest that for the degenerate case t m is a 
weighted average of t over the electrons with each electron having a weight 
»%. For spherical energy surfaces this is found to be the case, for then 
3kT\s m(v 2 f p ) (see Problems) and then we may write 

t„ = <T0%>/<tr%>; t h /t, = <t 2 vH v XvH p >/<tvH p > 2 . (42) 

It should be noted that (40) is the general expression of which (42) is 
only a special case. Precisely similar expressions are obtained for holes by 
replacing f p by f and averaging over the holes. 

11.4c. Simple Metals and Degenerate Semiconductors. For the case 
of a highly degenerate semiconductor, or a metal, most of the states in the 
band of interest are either full with f = 1 and f (1 — f) = or else they are 
empty with f = and again with f (1 — f) = 0. The important con- 
tributions in the sum over the energy shells thus come from a relatively 
narrow energy region, about 2kT wide, centered at & = & F , in which 
f (1 — f) is approximately (^) 2 = 0.25. In this narrow range, the other 
functions of & may be regarded as constant. Accordingly, the integral 
for a, obtained from the integrated form of (8) by dividing by E x , is 

e 2 r 

- r Tv 2 ji{\-i)dS (43) 



3kTI 



where the approximation of spherical energy surfaces has been used in 
taking v 2 out of the integral. The integral can be converted directly to an 
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integration over energy by using equation (9) of Section 9.1: 

dS = (4w/h 3 )(2m) 3A (& ~ S ) H */S. (44) 

It is simpler, however, to use another procedure. If we let 2 &i represent 
S — &o, then the number of states S in the Brillouin zone with energy less 
than 6>i is proportional to P 3 or to &i yi . As a consequence of this, we have 
at the highest, occupied states, that is, for &i = G>f — d> > 

since the total number of occupied states is the electron density n times V. 
The integration of f(l — f)d& is then carried out, using equation (6), as 
follows : 

f±d& 1 i(l-f)=f-dt=l (46) 

when integrated across the Fermi level 6>f since f varies from unity to zero 
over this range. Thus a becomes 

e 2 9 2>nV er er ,,„ 

e = —/tv'—^- = ne— ; m = — > (47) 

ZV 2&X m m 

the second equality following from 6>i = mv 2 /2. This expression is just 
that obtained on the basis of a single mean free time in Chapter 8. The 
physical reason for this simple result is that all the electrons are accelerated 
by the field so that / = ne 2 E/m as discussed in Chapter 5. The decay of 
the current is all brought about by electrons in the outermost energy shell 
however, since all inner shells are full. Thus, the conductivity involves 
the density n of all the electrons, but the relaxation time t of only those of 
highest energy. The transverse component of current produced by the 
magnetic field also decays with the time constant t and consequently 
T = Th. Since t m and tjj are equal, the formula for the Hall constant is 
simply 

R H = -1 1 nee. (48) 

The Conwell-Weisskopf formula has been extended to the degenerate 
range by Johnson and Lark-Horovitz. 3 For this case only a single energy 
is involved and it is not necessary to average tj over the energy. The 
energy & required is the highest occupied state of the degenerate electron 
gas; it is evaluated as follows: 

2 4,r n3 n h(Zn\ A _ h 2 /3nV A 

n =i*-3 p > p -iU > & =^u ■ (49) 

2 Si in these manipulations is not to be confused with §i used for thermal scattering. 

3 V. A. Johnson and K. Lark-Horovitz, Phys. Rev. 71, 374-375 (1947). They also give a 
general integral for intermediate cases. 
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The formula for the relaxation time discussed in (17) of Section 11.3 is 
_ 4 t c _ 4 ai (25/gj) 2 

T/ t (1 - cose) 7T v 21n[l + (2S/S0 2 ] ( ' 

The quantity 26>/&i has the value 4 



2S ne mi h 2 /3n\ % K (3/t) m Kh 2 n H 

= 2© 



8m\w) 



3 9.™ V - / * 2 *,^ g OTf 2 



£>/ 2? 2 8m\w / e 2 n 



= 3.4 X KrV (for S,-) 

= 5.0 X 1(T 7 « K (for G e ), (51) 

with m as the electron mass and k = 13 and 19 respectively. 4 * In the 
foregoing equation we have set n = «r since in a "doped" degenerate 
sample the electrons and donors must be very nearly equal in number. 
From this equation we see that 2&/6>i = 1 for n = 3 X 10 19 . From a 
limiting form for the average of the mean free time, Johnson and Lark- 
Horovitz conclude that 26>/&i will be small. 5 On this basis they simplify 
their expression by what is equivalent to expanding the logarithm term, 
leading to a cancellation of the term (2S/&/) 2 in the numerator, a result 
equivalent to setting (1 — cos 8) equal to 2. The physical meaning is that, 
for & <§; &i, the electron will follow an almost parabolic orbit around the 
ion and shoot back out of the cell with its velocity completely reversed, 
that is, the collision will have 6 = 180° or cos 6 — — 1. For this case the 
formula for mobility becomes 



e 2cn 2erT l/i 4 [V| H 
and the resistivity becomes 



4 rv~\ H e 
M = w T/ = w"^ = ~^" = xL3j Im* (52) 



ney. 4L.irJ e 



The value obtained by Johnson and Lark-Horovitz is it times smaller due 
to differences in the approximations. In practical units their value becomes 

p = 6270«~ M ohm-cm (54) 

with n in cm -3 . 

11.5 COMPARISON WITH EXPERIMENT 

The most extensive presentation of a comparison of theory with experi- 
ment for semiconductors of the type covered by the theory of this chapter 

4 2&/&i is the same function of n as is y of equation (62) in G. L. Pearson and J. Bardeen, 
Phys. Rev. 75, 865-883 (1949). 

4a See Chapter 9 for later values of k. 

6 This conclusion does not seem to be valid unless n < 3 X 10 19 as discussed previously. 
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is found in the paper of Pearson and Bardeen, to which we have so often 
referred. 1 The specimens studied by them may be broadly divided into 
two classes: degenerate and non-degenerate, divided by the condition [see 
equation (16) of Section 10.3] 

T = m* *!_ n% = 42 x 10 _ n r m_~\ n% o K 

LirJ okm n \jn n \ 

kT = 21.6 X V)- 16 (m/m n )n M electron volts (lb) 

for »-type, and a similar equation for p-type. 

For the degenerate cases, the important electron velocity corresponds 
to the highest occupied electron level and does not change appreciably with 
temperature. For these cases, the impurity density is so high that im- 
purity scattering dominates lattice scattering. As a result, the resistivity 
and Hall coefficient are both practically constant over a wide temperature 
range. Samples 6, 7, 8 in Figures 11.7 and 11.8 (taken from Pearson and 
Bardeen) exhibit this behavior. These are p-type. samples containing 
added densities of boron to the amount indicated on the curves. The fact 
that the added boron content for sample 8 exceeds the number of carriers 
is interpreted, on the basis of other evidence, as due to the precipitation of a 
second boron-rich phase on the grain boundaries of this alloy, thus dimin- 
ishing the effective boron concentration. The samples become non- 
degenerate at higher temperatures, the line of demarcation being indicated 
as Tdeg on Figure 11.8 where m/m n has been set equal to unity for simplicity. 
(At still higher temperatures they would become intrinsic.) The mobilities 
for 6, 7, 8 are respectively 31, 30, 54 cm 2 /volt sec. This increase of mo- 
bility with increasing impurity content is in the opposite direction from the 
trend predicted by extending the Conwell-Weisskopf formula into this 
range even if the complete expression rather than the Johnson-Lark- 

1 Very extensive and early investigations of the properties of germanium along the lines 
discussed here have been reported from the group at Purdue working under the direction of 
Professor K. Lark-Horovitz. Partial reports of their findings are given in the following 
references: K. Lark-Horovitz, A. E. Middleton, E. P. Miller, and I. Walerstein, Phys. Rev. 69, 
2S8A (1946). 

Additional investigations on germanium are reported in the following references: J. Bardeen 
and W. H. Brattain, Phys. Rev. 75 (8), 1208-1225 (1949); Bell Syst. Tech. J. 28 (2), 239-277 
(1949). 

Additional references of general interest in this connection are: K. Lark-Horovitz and 
V. A. Johnson, Phys. Rev. 69, 2S8A (1946); V. A. Johnson and K. Lark-Horovitz, Phys. Rev. 
71, 374-375 (1947); 72, 531 (1947); K. Lark-Horovitz, A. E. Middleton, E. P. Miller, 
W. W. Scanlon, and I. Walerstein, Phys. Rev. 69, 259 (1946); V. A. Johnson and K. Lark- 
Horovitz, Phys. Rev. 69, 259 (1946); W. Ringer and H. Welker, Zeits.fur Naturforschung 3A, 
20-29 (1948); W. C. Dunlap, Jr., G. E. Review 52, 9, February 1949. Much of the foregoing 
material is reviewed by G. L. Pearson, K. Lark-Horovitz, and N. C. Jamison, Electrical 
Engineering 68, 1047-1056 (1949). 
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Fig. 11-7 — Resistivity of Silicon-boron Alloys versus Temperature. 
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Fig. 11-8 — Density of Holes in Silicon-boron Alloys versus Temperature. 
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Horovitz approximation is used. Pearson and Bardeen have considered in 
addition a formula derived by Mott for application to alloys. Although 
all these formulae give general agreement with experiment as to order of 
magnitude of the mobilities, none appears to agree in detail in giving the 
correct dependence upon »/. Sample 5, which is expected to be degenerate 
below — 150°C, is anomalous in its behavior and does not have a constant 
mobility at low temperatures. 

Johnson and Lark-Horovitz have also made a comparison between their 
formula p = 6270»~~^ and some germanium samples having resistivities in 
the range of 2.9 to 5.1 X 10~ 3 ohm-cm, obtaining good agreement. 

There are difficulties in connection with applying the Conwell-Weisskopf 
procedure to the degenerate samples. The wave length of the fastest 
electrons of the degenerate gas is given by the formula 

Hence the unit cube considered in the scattering model is only \ wave 
length wide and its "radius" is about jX. For this relationship between 
wave length and spacing, the approximation of using the Rutherford 
formulae becomes completely untenable on the classical basis and very 
dubious on the quantum-mechanical basis since the individual scattering 
centers are so close together. Thus in addition to problems of determining 
the effective potential field in which the electrons move, there will be prob- 
lems of coherence in the scattering. 

For the non-degenerate samples, the formulae of the last section should 
be applicable. However, in certain ranges of temperature both scattering 
processes may be important and it is necessary to consider the combined 
effects of impurity and thermal scattering. For electrons of a given 
energy, the two relaxation processes will be additive. Hence, a perturba- 
tion of the equilibrium will decay in time dt by a fraction 

dt 



+ — 

Tf TT 



— -I \dt = -dt, (3) 

T/ Tj'J T 



where 

t = tit t /(tt + Tr). (4) 

This quantity should then replace t in the averages used in computing t m 
and th- If t m and tl depended upon v in the same way, this would lead 
to the conclusion that 

T " = t -l-T (S) 

and a similar expression for th. The errors introduced by this process, due 
to the fact that tj and Tr depend differently on v, may amount to SO per 
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cent or more. We shall not analyze this question further at this point but 
will use the approximation generally employed in comparing theory and 
experiment and take the two relaxation times as additive. 
We thus write as an approximation, 

1--L + .L. (« 

TV "V T »T 

and hence 

!-! + !. (7) 

where m and ht are the mobilities corresponding to the effects of impurities 
and thermal agitation individually. For these the formulae are : 

J(kT) H 



8 hJ 

■kVtN 



ln L 1 + W0. 

giving for silicon with k = 13, 
li = S.SXlVfH/Ni In (1+* 2 ) cm 2 /volt sec (for JV> cm" 3 , T°K) (9a) 
x = 2.3Xl0 4 r/JV/ M (9b) 

and similarly for germanium with k = 19, 

M = 1.2 • KW/iVr In (1 + x 2 ) (10a) 

x = 3.4 • 10* T/iVi H . (10b) 

The formula for ht is 

Mr = Z e- 2 ■H.ji = COnSt T C 11 ) 

and the experimentally determined values are 

M „ r = 15 • 10 5 r~ % cm 2 /volt sec for silicon; (12a) 

H pT = 5 • 10 5 r~ % cm 2 /volt sec for silicon; (12b) 

M„r = 19 • 10 6 r~ w cm 2 /volt sec for germanium; (12c) 

Aipr = 8.9 ■ 10 8 r"^ cm 2 /volt sec for germanium. (12d) 

The temperature dependencies of m and mt are quite different, their 
variations being approximately as T* 4 and as T~^ respectively. In 
Figure 11.9 these mobilities are plotted for holes and electrons in silicon 
and germanium. 2 

2 Values of k and of drift velocities in germanium obtained since this figure was prepared 
differ slightly. See Chapters 9 and 12. 
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Pearson and Bardeen find that they can fit the mobility data for non- 
degenerate samples, Figure 11.10, reasonably well with a formula of the 
form 

1 = 4* + >T* d3) 

fJi J. 

which corresponds to replacing the m curves of Figure 11.9 by straight 
lines. The values of a which they obtain are within about a factor of 2 




5 10 20 50 100 200 500 1000 

TEMPERATURE, T, IN DEGREES KELVIN 

Fig. 11-9 — Impurity and Thermal Scattering. 

of those predicted from the equation for m and the impurity content, the 
impurity content being estimated from the saturation behavior. The 
same degree of agreement is obtained for the data quoted in Chapter 1. 

Curve 1 deserves special comment; it represents a highly purified 
sample of silicon obtained from E. I. du Pont de Nemours & Company. 
The anomalous behavior of very high resistivity samples has often been 
observed in silicon and germanium and is attributed to inhomogeneity. 
For such samples, small density differences from place to place will make 
large proportional differences in conductivity. Under these conditions, 
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the current paths become tortuous, and for such paths an artificially low 
Hall effect and mobility will be obtained. 

/ = 3V2wm n kT n/4e = 4.12 X W^{m n Tlm)^ i i L cm (14) 

The value of the mean free path in the thermal agitation range is from 
(28a) of Section 11.4. For T = 300°K and m n = m,l = 7.1 + lO^Ww. 

TEMPERATURE IN DEGREES C 
100 O -100 -150 




TEMPERATURE IN DEGREES K 

Fig. 11-10 — Hole Mobility in Silicon-boron Alloys versus Temperature. 

Values of the mean free path based on this formula with m n /m — 1 and 
rrip/m = 1 are shown in Figures 11.11 and 11.12. The values for low 
temperatures, where lattice scattering predominates, correspond to in- 
serting (36b) of Section 11.4 into (14) thus obtaining 



4k 2 (kT) 2 



aj\(m 



I = 



1 + 



/3KkT\n , r /6kT\ 2 ~\ 



(15) 
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Fig. 11-11 — Mean Free Path in Silicon-boron Alloys versus Temperature. 
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Fig. 11-12 — Mean Free Path in Silicon-phosphorous Alloys versus Temperature. 
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where, in the notation of Section 8.3a, 

ai = Nr % and 6/ = 2**/**/. (16) 

Formula (15), as well as the curves of Figure 11.10, may be used to esti- 
mate ion densities from mobility data. 

Problems 

1. Derive the equation of the hyperbolic orbits. This may be done by 
setting the radial acceleration equal to the centrifugal force minus the 
attractive force 

mf = mrd 2 — Ze 2 /i<r 2 , 

and eliminating by the principle of conservation of angular momentum 
in a central field: 

mrH = J. 

The resulting equation for f may be converted to an equation for d u/dd , 
where u = 1/r, by two steps : 

dr dr . J 1 dr J du 

dt dd m r 2 dd m dd 

followed by a similar procedure for (d/dt)(dr/dt). The resulting equation 
may then be integrated to find u as a function of 8, the result being: 

1 _ TJyi 
r ~ nf 



u =- = — ~. r + A cos (6 + 6 ) 



where A and 6 are integration constants. 

For the derivation of the r equation and other steps, see standard intro- 
ductory texts to theoretical physics, such as L. Page, Introduction to 
Theoretical Physics, D. Van Nostrand Co., New York, 1935. 

2. Apply the expression for current density of Chapter 15 to the trans- 
mitted wave of equation (18) and verify that the transmitted current is 

4P„P 6 /(P„ + P b ) 2 ~ |4Z 1 Z 2 /(Z 1 + Z 2 ) 2 |, 

the latter form applying to electrical transmission lines. Verify that the 
sum of reflected and transmitted currents equals the incident current. 

3. Estimate the change in energy of an electron when it interacts with 
a phonon by considering the Doppler effect produced by the motion of the 
deformation pattern. Consider first the case of motion along the #-axis. 
Let the incident electron wave function be exp i(P\x — &it)/h and the 
reflected wave be exp i( — P 2 x — & 2 t)h. Let the pattern move with speed 
c so that its position is given by x — ct = 0. Show that the requirement 
of constant phase relation between incident and reflected waves at the 
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location x — ct leads to P 2 — Pi = Imc if & = P 2 /2m and hence for 
Vi = Px/w > c that AS — ww. 

Show further that, if the moving pattern has a periodic form such as 
sin 2w(x — ct)/\ c , the requirement of reinforcement of the waves reflected 
from successive planes of equal deformation is (P 2 + Pi)V^ = 2tt and 
hence that Si — G>2 = 2irhc/\ c = hv, independently of the form of the 
functional dependence of S on P. 

A more difficult problem is to generalize these results to three dimen- 
sions and compare them with the procedure of Section 17.6, Figure 17.6, 
and associated text. 

4. Show that in Section 11.4 equation (40) is equivalent to (43) when 
applied to a metal. 

5. Establish the formulae required for calculating (v n ) for integral 
values of n by letting 1/vt 2 = a and writing 

and evaluating dl n /da by differentiating under the integral sign. This 
leads to equation (21) of 11.4. Evaluate 7_i directly, and I_2 by con- 
sidering 

(7_ 2 ) 2 =f " frax ^ x f" <r a v'dy=f* ddf" (T ari rdr = x/4a. 

6. Prove that, if the spherical approximation holds, so that dS — av 2 dv 
where a is a constant, then 

<»%) = 2>kTm 

when averaged over the electrons in the distribution. This may be proved 
from the definition of (vH p ) which is 



(v%) =J" v%iav 2 dv/ j" iav 2 dv 



by integrating the numerator by parts with the aid of equation (6) of 
Section 11.4. 

7. Obtain the formal expression for the conductivity tensor for a non- 
cubic crystal in which t is a function of & only. The rate of change of 
current in d in shell dtb due to E is then 

2e 2 ,_ ,tid& 



E*] 



. rtr'C-M-W- 



where v = \v\. Hence show that 

__ if r dS2d& 

" ~ h-'kl 



: : r fvtni(l - f) 
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where vv is a tensor operator such that 

(vv)E = v(v • E). 

8. There is an apparent paradox associated with the occurrence of 
(v 2 t) in the expression for t^. From the viewpoint of an individual 
electron, the reasoning of Chapter 8 suggests that jlx = er/m. Hence, if 
different electrons have different values for t, we should expect to have 
H = e(r)/m. The error in the latter expression can be attributed to 
the neglect of the variation of mean free time of the electron due to the 
change in its velocity while it is being accelerated. The problem is to 
show that when this effect is taken into account the correct formula for 
conductivity may be obtained by considering the trajectories of the individ- 
ual electrons as was done in Section 8.5. 

Suppose that after a collision an electron is equally likely to end in any 
state in an energy shell d& but that t(P) may vary from point to point in 
the energy shell. Show from the principle of detailed balancing and the 
symmetry condition S(P) — d>(— P) that the rate at which electrons are 
scattered into a volume dV Pi is unaffected to the first order by applied 
electric fields and has a value 

i gi = (2/A 3 )f[6(P < dWr,,-/T(/\). 

If the average displacement of each of these electrons before its next 
collision is r,;, show that the average value of £1/ for all the electrons is 

Owing to the effect of the field, which is taken as i x E x for simplicity, each 
electron is accelerated by a force F x . As a result, its mean free time and 
velocity vary as 

dv 
dP x 

Show that the probability that the electron has not collided after time / is 

and hence that its average displacement is 

, / dT dv \ „ 
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(Note that the Pauli exclusion principle does not invalidate the inte- 
grated forms of these expressions.) Integrate this expression over P-space 
by writing dV P = dP x dPydP z and integrating dP x by parts so as to obtain 

Zv = ^fvv x rf(l-f)dF p 

Show that this is equivalent to the result of Problem 7. 

9. For spherical energy surfaces and t a function of v only show from 
the equation for r, in Problem 8 that the "mobility" of the electrons in an 
energy shell is 

[er(v)/m][\ - W^T/d\nv)]. 

Average this over a Maxwellian velocity distribution taking t(») = l/v 
and obtain (28b) of Section 11.4. 



CHAPTER 12 
APPLICATIONS TO TRANSISTOR ELECTRONICS 

12.1 INTRODUCTION 

In the earlier chapters of Part II the theory of the behavior of holes and 
electrons is presented largely in pictorial and descriptive terms. In 
Chapter 11, however, the statistical treatment of the influence of electric 
and magnetic fields is given in detail. The quantum-mechanical founda- 
tion for all these subjects is presented in Part III in terms of the wave 
functions for electrons in crystals. 

There remain, however, certain steps to be taken before the attributes of 
holes and electrons discussed in the earlier chapters of Part II can be 
embodied in an analytic theory for use in treating transistors and other 
semiconductor devices of the sort discussed in Part I. These additional 
steps are taken in this chapter, and some detailed examples of application 
are presented. 

This chapter serves the additional purpose, as has been indicated at 
various points in the text, of giving a critique of the data presented earlier. 
In particular the closing sections are concerned with values of 5>ig and the 
mobilities of holes and electrons. Much of the best data on these impor- 
tant topics was being accumulated as this book was being written and the 
values chosen for examples were later shown in many instances to be inac- 
curate. Since these values were used to illustrate principles rather than to 
compute definitive results, the task of revising them to keep them up to 
date while the book was in preparation has not been undertaken. Instead 
the best values available at the latest date at which proof could be revised 
have been incorporated in the last sections of this chapter. 

The basic equation which is required for treating many problems involv- 
ing the flow of particles is the equation of continuity. This equation states 
the principle of conservation of matter in the following form: The rate of 
increase of the particles in an element of volume is equal to the net inward 
flow across the surfaces of the element. In the case of holes and electrons 
this must be generalized, as discussed in Section 12.2, to allow for recom- 
bination and generation of hole electron pairs in the element of volume. 

In order to deal with most semiconducting devices it is necessary to 
consider the random diffusive motions of the charge carriers provoked by 
thermal agitation as well as the systematic drifts produced by electric fields 
and magnetic fields. (See Figure 8.5 for an illustration of both types of 
motion.) The diffusive motion is described by the diffusion constant 

295 
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which is related to the mobility by the Einstein relation D n = kTuJe for 
electrons of D p = kTn P /e for holes. This relationship can be derived 
directly by the statistical methods of Chapter 1 1 or from a consideration of 
the principle of detailed balancing. We shall give the latter presentation 
in Section 12.3. 

In Section 12.4 we shall introduce the quasi Fermi levels, concepts which 
are of considerable aid in analyzing processes in semiconductors when hole 
and electron densities are not in equilibrium. 

In Section 12.5 we shall apply diffusion theory to obtain the charac- 
teristics of a simplified model of a p-n junction including the rectification 
characteristic and the dependence of admittance on frequency. 

In Section 12.6 we shall deal with diffusion and recombination in a 
germanium filament. In many cases the recombination takes place chiefly 
on the surface of the filament and is limited by diffusion to the surface. 

In Section 12.7 a treatment due to C. Herring of the motion of a pulse of 
holes (or electrons) in a body of opposite conductivity type will be pre- 
sented indicating how deformation of the wave shape may arise. 

In Section 12.8 various ways of determining & 10 , the rate of change of 
energy gap with volume, are reviewed and compared with values obtained 
from mobility measurements. 

In Section 12.9, data relating to mobility and the effect of magnetic 
fields upon conduction are reviewed and the best values obtainable at the 
time of going to press for drift and Hall mobilities in germanium are 
quoted. 

In Section 12.10, some experimental results and speculations on the origin 
of noise are presented. 

The concluding section expresses in general terms an appraisal of semi- 
conductor theory and experiment and analyzes future prospects. This 
section belongs in some ways at the end of the book but since it most 
closely relates to the material of Parts I and II it is placed at the end of 
Part II. 

12.2 NOTATION, CONTINUITY EQUATION, USE OF q FOR e 

In the remainder of this chapter we shall use the following notation. 
Some of the terms are defined in Section 12.3. 

^ = electrostatic potential (1) 

E = - V^ = electric field (2) 

<p = Fermi level (3 a) 

<p n = quasi Fermi level for electrons (3b) 

<p p = quasi Fermi level for holes (3c) 

+q = charge of one hole = e (4) 

e = base of Napierian logarithms. (5i 
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The notation q for the electronic charge has been found convenient by the 
author when dealing with problems in which both applied potentials and 
temperature appear explicitly, as they do frequently in transistor elec- 
tronics. The use of q replaces Boltzmann factors like 
e -.*/*r = exp (_^/*r) by e -^ /fcT . 

The alternative use of e for the base of Napierian logarithms is undesirable 
since € and e can occur in the dielectric constant. The choice of ip — psi 
for potential and <p = phi for Fermi level is made so that the phonetic 
relationship may act as an aid to memory. 

We shall also assume that any deviations of hole or electron densities 
from their thermal equilibrium values tend to decay with time constants, 
or lifetimes, denoted by 

t p = lifetime for holes in «-type material j 1 (6a) 

t„ = lifetime for electrons in ^>-type material. (6b) 

In connection with p-n junctions we shall introduce for the thermal equi- 
librium densities the following notation: 

n„ and p n , electrons and holes in w-type; (7a) 

n p and p p , electrons and holes in p-type. (7b) 

The basic equation which is required to treat the behavior of holes and 
electrons under conditions in which the concentrations are functions of 
time and space is the continuity equation. This may be written, for the hole 
density p, in the form 

^=Gr-r)-V/,. (8) 

dt q 

The derivation is as follows: Consider a small region dx, dy, dz centered at 
x, y, z. The rate of change of holes in dxdydz is 

^ dxdydz. (9) 

These holes arise from the excess of generation over recombination in the 
volume and from a net flow across the surfaces. The first process is 
represented by 

(g — r)dxdydz (10) 

where g is the net rate of generation of holes per unit volume (due to light, 
X rays, thermal excitation, etc.), and r the net rate of decay by recombina- 
tion with electrons. Since p represents untrapped holes, r may include 

1 Tau appears in the form t in Chapters 8 and 11 to represent the relaxation time, the 
dependence upon v being indicated by the fact that it is bold face. The symbols for lifetime 
used in this chapter are of lighter weight so that no confusion should result. 
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holes which become trapped without recombining and g may include their 
re-emission. The term containing V • I p represents hole flow across the 
boundaries and is derived as follows: The current density in the ^-direction 
at the midpoint of the dydz face located at x — {dx/2) is 

l^y,z)- d -^f^ d i- (11) 

dx 2 

This midpoint value will be the average value for the face so that the hole 
flow across the face into dxdydz will be 



If*,. dip* dx~\ 



q L 



The flow out of dxdydz across the face at x + {dx/2) differs only by replac- 
ing the ( — ) by (+). The net flow across the dydz pair of faces into 
dxdydz is thus 

i r ei px dxi i r di px dxi 
«|> - 17 1 J dydz -- q V»* + -Jx~ TJ dydz 

= —^dxdydz. (12) 

q dx 

Proceeding similarly for the dxdy and dxdz faces leads to a net flow into 
dxdydz of 

b lT + *-¥■ + d ~¥\ ixd y dz ^ - - V ■ hdxdydz (13) 

dx dy dz J 9 

where the quantity in square brackets is by definition V • I P , the divergence 
of I p . Equating the net rate of change in dxdydz to the net generation plus 
the net flow in across the surfaces leads to the continuity equation (8) times 
a common factor dxdydz, which may be divided out, thus establishing (8). 
In general it will be assumed that deviations for injected carrier densities 
p or n from the equilibrium values p n and n p tend to decay with char- 
acteristic lifetimes which are independent of the concentration of the 
carriers or the way in which they are introduced. For this purpose we 
consider that due to recombination and spontaneous generation the net rate 
of change of injected holes in «-type is 

{pn ~ P)/t p (14a) 

and for electrons in p-type the corresponding contribution is 

{n v - »)/t„. (14b) 

The continuity equation for holes is then 

dp pn- P 1 , , n , . 

— = V • I P + g P (15a) 

ot T P q 
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where g v ' is the net rate of generation of holes in the volume due to photons 
or other external exciting agency. The similar equation for electrons 
in ^>-type material is 



dn _ n v — n 1 
Bt t„ q 



+ ~ V • /„ + gn'. (15b) 



We shall use these equations for the case in which g p = g n — to treat 
the flow of holes and electrons in p-n junctions and filaments. (Sections 
12.5 and 12.6.) 

12.3 THE EINSTEIN RELATIONSHIP BETWEEN MOBILITY 
AND DIFFUSION CONSTANT 

We shall consider the diffusion process only for the case in which the 
density of carriers is so small that the effect of the Pauli exclusion principle 
can be neglected. Under these conditions we can suppose that each carrier 
drifts in the electric field and undergoes random diffusion independently of 
all the others. We shall also suppose that the electric field is small enough 
so that it does not appreciably disturb the equilibrium distribution of 
velocities of the carriers, this being the assumption which permitted the 
neglect of terms higher than the first power in E in Chapter 11. Under 
these conditions it is justifiable to consider that the hole current (we shall 
treat holes for our example) arises from superposing a conduction and a 
diffusion current density. The latter is directly proportional 1 to the 
gradient of hole density Vp. Thus we write 

I P = quppE — qDpVp. (la) 

In = qHnnE + qDnVn. (lb) 

The minus sign occurs in (la) because, in the absence of an electric field, 
the net diffusion of holes will be away from the direction in which p in- 
creases. Since the charge on the electron is minus, the corresponding 
sign in (lb) is plus. 

If the electrostatic field is produced under conditions of thermal equi- 
librium, as for example in the case of the transition region of a p-n junction 
as discussed in Section 4.2 or a metal semiconductor contact as in Sec- 
tion 4.3, then the statistical mechanical principle of detailed balancing 2 
requires that for every hole which is crossing a given area in one direction 
per unit time there is another which is crossing it in the reverse direction 
with the same speed. This last statement is true even if magnetic fields 

1 This can be shown by a statistical treatment of the individual holes by methods like 
those of Chapter 11. 

2 R. C. Tolman, The Principles of Statistical Mechanics, Oxford at the Clarendon Press, 
1938, p. 165. 
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are present, 3 a fact which we shall soon use. Thus under equilibrium con- 
ditions I p must vanish. Under equilibrium conditions, however, the hole 
density is governed by the Boltzmann factor, exp ( — qip/kT), which repre- 
sents the tendency of holes to avoid regions of high potential energy. 3 " The 
formula for p is given in equation (14) of Section 10.3, 

p = A€-^ lhT (2) 

where A is a constant involving N v and &f, which we shall discuss more 
fully in connection with the quasi Fermi levels. Inserting equation (2) 
into equation (1), evaluating Vp and requiring that l p = 0, we find that 

= qt x p pE -qD p (- j^ V*) Ae-*+i* T 

= q(ih>-jfD^pE (3) 

which can be satisfied only if the 

Einstein relationship n P = — — D p (4) 

kl 

holds. 4 A precisely similar calculation establishes the Einstein relationship 

Hn = T-D n (5) 

kl 

for electrons. (In treating electrons, it should be remembered that y, n is 
defined as positive and so is D n .) 

The dimensions of D p are (length) 2 /time. In the c.g.s. system D p is 
expressed in cm 2 /sec. If h p l is expressed in cm 2 /volt sec (the L system 
of Section 8.8), then equation (4) leads to 

li P L cm 2 /volt sec = — D p cm 2 /sec, (6) 

kl 

which is consistent because q/kT is charge/energy which has the dimen- 
sions of volts -1 ; at 300°K, kT/q = 0.026 volt so that for the L system and 
the M.K.S. system we have 

D p = D pL = 0.026m p i, (7a) 

D p m — 0.026/ipjif- (7b) 

3 See J. H. VanVleck, The Theory of Electric and Magnetic Susceptibilities, Oxford at the 
Clarendon Press, 1932, pp. 94-98. 

3a The relationship of qtp to &„ is discussed more fully in the next section. 

4 A. Einstein, Annalen der Physik, 17, 549-560 (1905); Muller-Pouillet, Lehrbuch der 
Pjysik, Braunschweig, 1933, IV (3), pp. 316-319. 
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The quantity D p t p , which occurs in the analysis of later sections, has the 
dimensions of (length) 2 and is denoted by the symbol L p where L p is called 
the diffusion length. The diffusion length, as discussed in Sections (12.5) 
and (12.6) and the Problems, is the average distance along any given 
direction, ^-direction for example, which a hole will diffuse before recom- 
bining, if its lifetime is t p . 

Under non-equilibrium conditions the current may be written as the 
sum of a conduction term and a diffusion term as given by equation (1), 
but the two terms in general will not cancel. 

When a magnetic field is present, the conduction current does not usually 
flow parallel to the electric field and the diffusion current does not flow 
parallel to Vp. The conductance and diffusion constants both become 
tensors which we may derive from the resistivity tensor p. We shall treat 
the simplest case: For a cubic crystal with H along the z-axis, [001] direc- 
tion, we may write the electric field in the form 

E x = p t {I x - I y tan 6) (8a) 

E v = P ,(I X tan 6 + I y ) (8b) 

E z = pih (8c) 

where p t is the resistivity transverse to H [defined by setting power dissi- 
pation density equal to p t (I x 2 + I v 2 )], and pi (/ for longitudinal) is the 
resistivity parallel to H, and 6 is the Hall angle, the angle between E and / 
for transverse currents. This equation is analogous to the Hall effect 
formulae of Chapters 8 and 11 except that we have written it in a form 
adaptable to large values of H and 6. In this form 6 is plus for holes, and 
negative for electrons. Equations (8a) and (8b) may be written com- 
pactly as a complex equation: 

E x + iE v = (p,e*/cos 6) (I x + il y ). (8ab) 

In order to define the diffusion constant we shall solve equation (8) for 
the currents and express these as I x — qpv x , etc., where the v's are the drift 
velocities. We introduce 

p. t = l/qppt, p-i = l/qppi (9) 

for this purpose. We then obtain 

v x + iv y = e~ if V« cos B(E X + iE v ) (lOab) 

or 

v x = ii t cos 2 B{E X + E v tan 6) s= p. xx E x + n X yE v (10a) 

v y = ii t cos 2 6( — E x tan 8 + E y ) = n yx E x + ii vv Ey (10b) 

v z = mE z = ii zz E z . (10c) 

The diffusion current is then found from the principle of detailed balanc- 
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ing and the Boltzmann distribution (2), neither of which is affected by a 
magnetic field. The resulting relationships are 

I '- = - 9 ( D " d £ +D " a i) (llb) 

I, = -qD„-£ (lie) 

where each D is (kT/q) times the corresponding y. of equations (10). A 
similar relationship holds for electrons. 

12.4 ENERGY BAND DIAGRAMS, ELECTROSTATIC 
POTENTIAL, AND QUASI FERMI LEVELS 

In Figure 12.1 we represent the energy leve! scheme for a p-n junction. 
We have discussed similar diagrams on a largely intuitive basis in Chapter 4 
and briefly in connection with Figures 5.13 and 7.6. We shall now lay 
an analytical foundation for treating them. The diagram implies that an 
electron in the lowest state of the conduction band, or better a localized 
wave-packet made up of wave functions near the lowest state, will have 
different energies in different parts of the crystal. This situation repre- 
sents a macroscopic electric field since the energy of interaction of the 
electron with the local atomic field is the same no matter where the electron 
is in the crystal, provided that its wave function distributes it in the same 
way with respect to its immediate environment in all cases. Thus if the 
electrostatic potential at one point is ^a and at another is \pB, then, since 
the charge on the electron is —q, the energy 6> c of the lowest state in the 
conduction band will have a value of 

&cB = &cA + q(ipA - fa) (1) 

or, if B is varied while A is held fixed, 

&cB = — q4*B + constant. (2) 

A similar relationship will hold for a wave-packet at the top of the.valence- 
bond band, and the energy of an electron in such a wave-packet will also 
vary as — qfy. From this we can conclude that the energy of the missing 
wave-packet, corresponding to a hole, varies as q\f>. We shall illustrate 
this more fully as follows: 

The potential energy of a hole may be defined in terms of the electro- 
static work done in moving the hole. We may evaluate this energy by the 
following process: Suppose we have a hole at B. We can get the hole to 
A by exciting an electron at A from the valence-bond band to the conduc- 
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tion band, which costs us 6>g; we can then move the electron to B putting 
in work —q(^B — ^a) and then let it fall into the hole at B regaining S G . 
Thus moving the hole from B to A requires work to the amount 
qi&A — $b), which is just that required to move a charge +q from B to A. 
Accordingly, we may take the energy of a hole to vary as +q<p. This 
result is entirely consistent with the conclusions reached in Chapters 
7, 8, and 11 that a hole could be considered to behave as a positive particle 
with positive mass and to be accelerated by a force qE. These same 
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Fig. 12-1 — The Relationship between Energy Levels and Electrostatic Potential. 

conclusions are reached on the basis of more detailed arguments in Section 
15.8, especially as shown in equations (10), (11), and (12). 

It is convenient for our purposes to define the electrostatic potential in 
terms of a level of energy (not called an energy level because there need be 
no quantum states at that energy) approximately midway between the 
bands. We shall give the mathematical definitions in this paragraph and 
shall enlarge below upon the physical interpretation. We shall denote the 
level as €»• (/ for intrinsic for reasons which will soon become evident) 
and define it as 

&i = U&c + S v + kT In (N v /N c )], (3) 



the symbols having the meanings of Chapter 10. 
tial will be defined as 

$ = ~&i/q 
The energies & c and S v are then 



The electrostatic poten- 
or &{ = — qty. (4) 



So = -# + \kT\n (N c /N v ) + ±S G 
S, = -# + %kT\n (N e /N v ) - %S G 



(5) 
(6) 
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where 

&G = S c — S„. (7) 

We shall also define a Fermi level in terms of an electrostatic potential by 
the equation 

<p = —&f/i or &>p = —q<p. (8) 

We shall refer to <p, as well as to &f , as the Fermi level. The reason for 
these choices is that, when <p is equal to yj/, the densities of holes and 
electrons are equal as may be seen by substituting into equations (11) and 
(14) of Section 10.3, which give in terms of ^ and <p 

n = N c e-^- &F)/kT = n^*-* )/hT (9) 

p = N v e -<*»-m*t = Wie «(»-«/tr (1Q) 
where 

»,• = VN V N C e ~ S(1/2kT = Vpn, (11) 

in accordance with the results of equation (19) of Section 10.3. The 
quantity »; represents the density of holes and of electrons in an intrinsic 
sample for which the condition of zero space charge requires the equality 
p = n. 

We shall next describe in words the content of the equations presented 
above. For this purpose we consider a semiconductor containing a p-n 
junction so that the electrostatic potential and the values of p and n vary 
with position. There will evidently be a point in the transition region at 
which p and n are equal and thus each is equal to »,-, the value in an intrinsic 
sample. At this point the Fermi level will be approximately midway be- 
tween the bands. However, if the Fermi level were exactly midway then 
f = exp [(&f — & c )/kT] andfp = exp [(£>„ — & F )/kT\ would be equal and 
the ratio of p to n would be N c /N v . For the case of N c greater than N v , the 
midpoint of the energy band must lie above &f so as to reduce n and 
increase p until n = p. An upward shift of the band by AS decreases n by 
exp ( — A&/kT) and increases p by the reciprocal ratio exp (A&/kT) so 
that n/p is reduced by exp ( — 2A&/kT). Equating this last factor to 
N v /N c , the required reduction in n/p, leads to 

AS = (kT/2) In N c /N v . 

Consequently, at the point where n — p = »,-, the level of energy which 
lies AS below the midpoint of the energy gap will be equal to 6>f. This 
level of energy is S», defined in (3), since it lies at 

i(5 e + S,) - AS = i(S c + S„) - |* Tin N c /N v 
= %(& c + &„ + kT In N v /N c ). 
At points in the p-n junction where S,- rises above S/?, n is decreased from 
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Hi by a factor exp (Sy — &i)/kT and p is increased from n t by the recip- 
rocal factor. If we define an electrostatic potential by — q^ = &i 
and a Fermi potential by —q<p = &f, the preceding reasoning leads to 
equations (9) and (10) for n and p. Equations (9) and (10) can be 
remembered by the following scheme: When <p = \p, the densities cor- 
respond to the intrinsic case. Deviations of yp from <p introduce factors 
exp ±q{$ — <p)/kT. The plus sign corresponds to electrons since in- 
creases in ip correspond to more positive potentials and thus to lower po- 
tential energies for the negative electrons and hence to higher densities; 
the minus sign corresponds to holes, which are repelled by positive 
potentials. 

Differences in the Fermi potential <p are the quantities measured by a 
voltmeter. In order to illustrate this conclusion, we imagine that two 
conductors A and B are connected by a battery but that the circuit is open 
or of such high resistance that negligible currents flow. Under these 
conditions each conductor is substantially in thermal equilibrium and has a 
Fermi-Dirac distribution of electrons in its energy level scheme. The 
system as a whole is not in thermal equilibrium, however, and if metals A 
and B were brought in contact, currents would flow and irreversible heating 
effects would take place. The source of energy is, of course, the battery 
which is itself not in equilibrium. If metals A and B are identical and 
have identical work functions, then in the space between the two there will 
evidently be an electric field which produces a voltage difference between 
points just outside each metal equal to the battery voltage. Since the two 
metals have identical properties, each will have the same difference between 
the Fermi level and a wave-packet state representing an electron at rest 
just outside the metal. 1 Hence the difference in Fermi levels will be the 
battery potential. This result will also hold if the conductors are different, 
being A' and B', as may be seen as follows: Suppose A' is placed in con- 
tact with A and B' with B. Then since A' and A are in local equilibrium, 
they will have the same Fermi level and so will B and B' . Hence the dif- 
ference in Fermi potentials between A' and B' will also be equal to the 
battery potential. Another example of the theorem that the change in 
Fermi potential is equivalent to a change in applied voltage is given at the 
end of this section. A statistical mechanical treatment relating the Fermi 
level to a chemical potential for electrons is given at the end of Section 16.1. 

We shall next consider the application of equations (9) and (10) to the 
problem of the determination of the thermal equilibrium distribution of 
potential in a semiconductor of non-uniform impurity .content, the densities 
of donors and acceptors being functions Nd(r) and N a (r) of position r. 
We shall also assume that the donors and acceptors are fully ionized, as is 

1 This difference is the "true work function" of the surface. For a detailed discussion see 
C. Herring and M. H. Nichols, Rev. Mod. Phys., 21, 185-270 (1949), Section 1.2. 
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the case for germanium at room temperature when the Fermi level lies 
more than 0.1 volt from the edges of the energy gap. Under these con- 
ditions the charge density p is 

p = q(j> - n + N d - N a ) 

= q[N d (r) - N a (r) + 2»* sinh q(<p - 4>)/kT\- (12) 

Here <p may be regarded as specified, and ^ as an unknown function of 
position r. [The reader may find it instructive to compare equation (12) 
with Figures 16.2 to 16.6 in which the difference between heavy solid and 
heavy dotted lines is essentially equal to p plotted as a function of 
q(\p — <p).] The dependence of ^ upon r is determined from Poisson's 
equation, in e.s.u. and M.K.S. respectively, 

VV = -4xp/ K or VV = -pm/mo (13) 

which thus becomes an equation with one unknown ^ when (12) is inserted 
for p, it being assumed that N d (r) and N a (r) and the boundary conditions 
on \p are specified. 

It is not difficult to show that this equation always has solutions. The 
one-dimensional case is treated in detail in the literature. 2 We shall not 
give an analytical treatment here but shall discuss the implications of 
equations (12) and (13) in terms of physical arguments. Suppose we have 
a solution for the case of uniform values of N d and N a . (Such solutions are 
obtained in Chapters 10 and 16.) Then suppose we increase N d in some 
small region R, holding n and p arbitrarily constant with values » and p . 
This change will increase the electrostatic potential in R from \p to ^i 
where \pi is more positive since the change in N d represents an added posi- 
tive charge. We next imagine that ^ is held fixed with value fa and that 
the hole and electron densities are allowed to readjust themselves with a 
resultant flow of holes out and electrons into R. These flows will make a 
negative contribution to the charge density inside R, consistent with the 
sign of the term containing \p in equation (12), which becomes more nega- 
tive as ip becomes more positive. The change in charge density due to 
carrier flow thus tends to counteract the original change produced by 
increasing N d . This tendency to counteract is really the essential point. 
If we change N d , the change in n and^> is such as to tend to compensate the 
changed charge density. As a result the flow of holes and electrons tends 
to stabilize the potential and to neutralize the effect of changes in impurity 
density. The foregoing remarks are supposed to furnish food for thought 
and to improve the reader's insight into the behavior of the charge densities. 
A rigorous argument can be built up from this beginning to show that a 

2 W. Shockley, Bell Syst. Tech. J. 28, 435-489 (1949), Appendix VII; J. C. Slater, Phys. 
Rev., 76, 1592-1601 (1949). References are given in both papers. 
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solution for ^, and consequently n and p, can be found for arbitrary values 
o(N d (r) andAT (r). 

We shall next apply the arguments just given to the p-n junction shown 
in Figure 12.2. To the left of the junction in thep-region, we shall suppose 
that N a — Na is positive and constant with a value p v — n v . In this 
region, the electron density n p is very small compared to the hole density 
so that the space charge condition leads with a negligible error to 

N a - N d = p p 
or, letting ypp represent the value of \p in the p-region, 

f )lkT 



p p = me-^- 

so that by dividing by tii and taking the logarithm, we obtain 

kT, 



(14) 
(15) 



^ P = <P 



■In (pp/n,). 



(16) 
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I TRANSITION | 
> < REGION >< n-REGION 



<A. 



r -f 




DISTANCE X- 



Fig. 12-2 — The Dependence of n and p Upon Potential vff and Fermi Level <p for a 
p-n Junction with Higher Conductivity in the ^-Region. 



In the »-region, we shall suppose that N d — N a is positive and uniform 
with a value denoted by n n . The value of \f/ in the w-region is thus 



kT 

^n = <P -\ In {n n ln % ) 



(17) 



Thus \p n is more positive than ^ p for the equilibrium condition of constant <p. 
This situation is similar to the one discussed in Chapter 4. We shall next 
express in analytic form the conclusion presented in Chapter 4 that the 
tendency for holes to diffuse out of the p-region, where their density is 
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greatest, is canceled by the tendency for them to flow back owing to the 
electric field. We shall introduce the quasi Fermi levels for this purpose. 
The combined action of diffusion and conduction is conveniently treated 
with the aid of the quasi Fermi levels <p n for electrons and <p p for holes. 
These are defined in terms of n, p, and ^ for non-equilibrium conditions by 
the equations 

kT 

» = » i e« ( *-*» )/ * 7 ', Vn = i In (»/»,■) (18) 

9 

kT 
p = n^-^-^ lkT , <p p = * + — In {p/m). (19) 

? 

It is evident that for thermal equilibrium, which is the only condition for 
which a Fermi level <p is really defined, both <p p and <p n reduce to <p. 

In terms of <p„ and <p p , the total current densities may be rewritten using 
the Einstein relationship to eliminate D p in the simple forms 

kT — q(ip — <Pp) 

h = qVppE - qD p V p = -qn p pVt - q — n p pV — 

q kl 

= —qn p pVtpp (20) 

and similarly 

I n = —qnnnVpn- (21) 

These equations are formally the same as those for currents produced by 
electric fields — V<p p and — V<p n in materials of conductivity gn P p and q/i n n. 

The cancellation of the competing tendencies discussed in connection 
with Figure 12.2 is thus simply interpreted as the condition that <p n and <p f 
are both constant, being equal to <p, so that diffusion and conduction cur- 
rents are equal and opposite. In the next section we shall discuss p-n 
junctions which are not in thermal equilibrium and show how the quasi 
Fermi levels may be used in analyzing the processes involved. 

As an additional example of the behavior of <p n , <p p and ^ we consider a 
long, uniform rod of semiconductor carrying a current in the ^-direction. 
We assume that ohmic contacts are made at the ends so that the densities 
of the holes and electrons are unperturbed. For this case, the differ- 
ences $ — <p n and \f/ — <p p , which determine the densities n and p, are inde- 
pendent of position and have the same values as for the thermal equilib- 
rium state, for which <p n =» <p p = <p. The plot of <p n , <p p and ^ versus dis- 
tance is the same as for the equilibrium case except that a potential —E x x 
will be added to <p n , <p p , and ^. Equations (20) and (21) for the current 
densities will reduce to the conventional form I x = (qii p p + qi* n n)E x 
and the change in electrostatic potential, or voltage, and changes in quasi 
Fermi levels will be equal. 
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12.5 THE THEORY OF A p-n JUNCTION 

In this section we shall illustrate the use of the quasi Fermi levels in 
analysis and at the same time give a treatment of a simplified model of a 
p-n junction. 1 Figure 12.3 represents the distribution of <p p , <p n , and \p in 
a p-n junction under equilibrium conditions and also biassed in the forward 
direction by a potential 8<p, which is represented as if applied to the ^-region 
on the left while the right side of the w-region is held at constant potential. 
Under these conditions holes are injected into the w-region, and electrons 
into the p-region. We suppose that the uniform regions are so wide that 
injected carriers recombine before reaching the positions x a and x b . As a 
consequence, the total current 2 / is carried almost entirely by holes at the 
left edge of the figure and almost entirely by electrons at the right edge. 
The hole current I p must thus equal / at x = x a and must gradually drop 
to zero at x = x b , while I n = I — I p correspondingly increases. The 
mechanism of change in current type can be understood by considering the 
hole current flowing past xm- This current decreases because the holes 
have a lifetime t p in the w-region and combine with electrons so that the 
hole current attenuates as x increases; a similar effect takes place for the 
electron current entering the p-region. We shall assume in the analysis 
below that the transition region is so narrow that recombination in it is 
negligible; consequently, I p has the same value at xt p and xm- 

At x a and x b , the hole and electron densities thus have the values char- 
acteristic of uniform material and consequently at x a , for example, <p p and 
<p n are equal and differ from \f/ by the same amount as for the equilibrium 
case. This explains why the figure shows 

<Pp(x a ) = <Pn(Xa) = <PpM + 8<(> = <f>„(x b ) + 8<p (1) 

where 8<p is the applied potential and why 8\[/ = 5<p p = 8<p n at x = x a and 
ty = at x = x b . We shall next suppose that 8<p is small compared to 
kT/q so that only small disturbances in n and p are produced by the 
applied potentials. 

On the basis of the foregoing assumptions we shall prove the important 
result for p-n junctions that <p p is nearly constant in the p-region and drops by 
8ip in the n-region while <p n is nearly constant in the n-region and changes by 
8<p in the p-region. To prove this statement we shall suppose that the two 
homogeneous regions have conductivities substantially larger than intrinsic, 
so that the densities of the less abundant carriers, denoted by n p and p n , 
are very much smaller than the more abundant carriers p p and n n . This 

1 The treatment follows that of W. Shockley, Bell. Syst. Tech. J. 28, 435-489 (1949) but 
omits the more detailed considerations. See this article and Section 4.2 for additional 
references. 

2 In the analytic development below, the scalar symbols /, I p , and I n will represent current 
densities in the ^-direction. 
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relationship follows from 

n p = nf/pp and p n = n { 2 /n n . (2) 

[See Section 12.2 and equation (11) of Section 12.4 for details.] We shall 
next obtain the ratio of V<p v at Xr P and xr n in terms of these densities by 
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Fig. 12-3— Simplified Model of a p-n Junction. 

(a) Distribution of donors and acceptors. 

(b) Potentials for thermal equilibrium. 

(c) Effect of a potential 8<p applied in the forward direction. 



(c) 



using the expression for current density (20) of Section 12.4. At xt p we 
obtain 



I P = -qiippi 



d<p p (x Tp ) 
dx 



(3a) 
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Since we neglect recombination in the transition region, this same current 
flows at Xf n and we have 

I v = —quppn -. ' V°) 



From these we obtain 

d<p p (xTn)/dx _ pp _ p p n„ . . 

d<p p (x Tp )/dx p„ n? 

From (3c) we conclude that the change in ip p is much larger in the w-region 
than in the p-region, at least in the neighborhood of the junction. Similar 
arguments apply to <p„. Consequently Vp™ and V<p p will have to be very 
large wherever I p is appreciable in the w-region or /„ is appreciable in the 
p-region. Hence we conclude that the important changes in <p p and <p n 
occur as shown in Figure 12.3, where injected currents flow. 

Another way of stating this significant result is that the hole current is 
carried by material of high hole conductivity in the p-region and of low hole 
conductivity in the n-region; hence the gradient must be much greater in the 
^-region. 

The last statement brings out an important point: The resistance of a 
p-n junction is much greater than the integrated resistivity of the material 
composing the junction. This point follows from the fact that the hole 
current, for example, must flow a certain distance in regions where the 
density of the carriers is much less than w,, and in such a region V<p P is 
much greater than it would be even in intrinsic material. To be more 
specific, the junction resistance may be much greater than the resistance of 
a layer of intrinsic material as thick as the transition region. 3 

Until the injected carrier density becomes comparable with the density 
normally present, the arguments presented previously will be valid, and the 
important changes in <p p and <p n will occur in the regions in which the cor- 
responding carriers are present only in relatively small densities. In the 
p-region, p itself is nearly constant and equal to p p ; this follows from the 
condition for electrical neutrality which requires that the changes in p be 
equal to the changes in n produced by injection, and these latter are small 
compared to p p . Hence by equation (19) of Section 12.4, \p as well as 
(pp is nearly constant in the p-region ; similarly, ^ and <p n are nearly constant 
in the w-region. 

The purpose of establishing the conclusions just discussed, which are 
illustrated by the curves for xf/, <p v , and <p n in Figure 12.3, is to obtain a set 
of boundary conditions which can be applied to hole flow into the w-region 
across the plane at x Tn - Similar reasoning may be used to obtain expres- 
sions for electron flow into the p-region across xt p . In carrying out the 

3 See Shockley, lot. «'/., for a general analysis of this subject. 
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analysis we shall assume that all quantities are independent of y and z. 
As a consequence, all currents flow parallel to the .v-axis, and we need con- 
sider only the ^-component. For brevity we shall continue to omit the 
subscript x and replace 

I P x by I p (4) 

in this section. We shall neglect the contribution of recombination in the 
transition region itself to the conversion, as x increases, of I p to /„, the 
transition region being assumed very narrow compared to other dimensions 
such as the diffusion lengths L p and L n which will be discussed; conse- 
quently, I p may be taken to have the same value from x — xt v to x = xm- 
On the basis of this assumption we may take the total conduction current 
per unit area across x = to be 

I = In{XTp) + Ip(XTn)- (5) 

We shall calculate Ipixm) using the boundary conditions already discussed, 
and obtain I n {.«Tp) by analogy. In the discussion, the description in words 
will be given in terms of forward current, dip > 0; the equations, however, 
will be equally applicable to Sip < 0, the case of reverse currents. 

At the right-hand boundary, Xm, of the transition region, the relationship 

<Pp = <Pn + Sip (6) 

will hold. For convenience we repeat the following definitions of quantities 
in the K-region: 

n n = Nd — N a , normal density of electrons; 
Pn = ni 2 /n n «: n n , normal density of holes; 
t p , lifetime of a hole in the «-region. 

Since we have assumed that the injected carrier density is small compared 
to the type normally present, the boundary conditions at Xm are 

n{x Tn ) = n n = n i e q ^-^ ) l kT (7a) 

p(x Tn ) = n t *«*r-+>l kT . (7b) 

Replacing exp ( — qij//kT) in (7b) by («»■/»„), exp ( — q<p n /kT) obtained from 
(7a) gives 

p(x Tn ) - (n i 2 /fi n )e"^-^ lkT = p n e^ lkT . (8) 

Equation (8) is the key equation of the rectification theory. It states 
that the hole density at xr n varies exponentially with the applied voltage. 
From the linear theory, which we will give in detail soon, it follows that 
the hole current must vary in proportion to the deviation of p from its 
equilibrium value and hence as p — p n = p n [exp (qhip/kT) — 1]. This 
leads at once to the correct form for the d-c characteristic given in (22). 



12.5] HOLE DENSITY IN THE n-REGION 313 

We shall suppose that 5<p consists of d-c and a-c components so that 

kp = V Q + v 1& iut (9) 

where »i is an a-c signal, assumed so small that linear theory may be 
employed (that is, Vi -C kT/q). Expanding (8) in series gives 

p(*Tn) = ij>n^ V " kT )[\ + {qvJkTW 



ii«£l 



We resolve this density into its equilibrium value p n , a d-c component po, 
and an a-c component pi exp (iwt) : 

p(x Tn ) = p n +p + Ptf™' (10) 

where 

Po = Pn{e qY ° lkT - 1) (11) 

Pi = (gPnVi/kT)e" nikT . (12) 

So long as p(xm) "^ tin, the normal density of electrons in the w-region, the 
lifetime t p , and diffusion constant D p fo>- a hole will be substantially 
unaltered by S<p. 

Our problem is to determine the value of p (x, t) for x > x? n and from 
this to obtain the resultant hole current Ipixm, t) flowing across xm- 
The total current /(/) is the sum of this hole current, a similar electron 
current, and a displacement current D/4ir = (k/4t)E or D = kc E in 
M.K.S. units. For p-n junctions the displacement current is, in general, 
small compared to the conduction current except at high reverse biases or 
low temperatures. 4 

As discussed previously, the hole current injected into the ^-region dies 
away with a relaxation time, or hole lifetime, r p . In accordance with the 
discussion of Section 12.2, equation (15), the continuity equation for this 
case is 

ft.fc^_l 7 . t (13) 

dt t p q 

Since \j/ is assumed substantially constant compared to <p p , the current 
density is largely diffusion current with a value of 

I p =-qD p ^- (14) 

ox 

The insertion of this in equation (13) leads to 

f !_t-=l + Dr &. (1!) 

dt t p dx 

4 See W. Shockley, Bell Syst. Tech. J. 28, 435-489 (1949), Section 4.4. 
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The solution of this equation subject to the boundary conditions at xm is 
P =pn + po<T*' IL > + pie™ 1 '* < 1 + toT »> M ' L P (16) 

where 

x' = x — xm, x' = at * = xm (17) 



L p = Vd v t p = diffusion length for holes in w-region. (18) 

The physical meaning of the diffusion length is as follows: Suppose at 
some value of x = xi the hole density is p{x{) and the electric field is 
negligible. Then the d-c part of equation (16) shows that, if there are no 
hole sources to the right of x u p attenuates by a factor e _1 in each interval 
L p . Some additional interpretations will be found in the problems. 

The condition that V^ is negligible compared to Wp, so that diffusion 
current dominates, can be given a simple interpretation if p » p n . For 
this condition we may write p = p(#i) exp [— (x — Xi)/L p ] which leads to 

_ kT(x- Xl ) + kT^ [p{xi)/ni] + ^ (19) 

qLp q 

according to the definition of <p p of equation (19) of Section 12.4. The 
diffusion term will dominate the current if in 

*?* = _ KL + d l (20) 

dx qL p dx 

the term kT/qL p is much greater than d-^/dx. (See problems.) 
The solution just obtained for p gives rise to a current per unit area at 

XTn of 

T, dp 
= qPoD p /Lp + qPiD^'O. + iwrp^/Lp. (21) 

The d-c part is obtained by substituting (11) for p : 

I P o(*Tn) = (qp n D p /L p )(e" r ' lkT - 1); 

s 7 p ,(e« F,/ * r - 1). (22) 

The forward direction corresponds to F > and for it the current in- 
creases exponentially. In the reverse direction, the exponential term 
vanishes and I p0 approaches a saturation current (qp n D p /L p ) = I ps . 
The comparison of equation (22), or rather its generalization (33), with 
experiment is discussed in Section 4.2 and at the end of this section. 

The a-c part is similarly obtained from the pi term of (21) combined 
with (12): 

Ipl(XTn) = {qPnV. P /L p )& V °l kT {\ + torque'"' 

s (G v + is,)!*™ s A v v^ 1 (23) 
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where A p is called the admittance (per unit area) for holes diffusing into 
the w-region ; its real part G p and imaginary part S p are the conductance 
and susceptance. For wt p small, the real term G p is simply the conductance 
per unit area of a layer L p thick with conductance 

qp n e" y " kT n p /L p = q(p n + p )» P /L p ; (24) 

for the case of zero bias this confirms the result stated earlier that the 
resistance to hole current flow arises from the low hole conductivity 
qpnVp of the ^-region. Equation (24) is also the differential conductance 
obtained by differentiating the d-c characteristic (22) with respect to F . 

In this section we have treated t p as arising from body recombination. 
In a sample whose y and z dimensions are comparable to L p or L n , surface 
recombination may play a dominant role. It can be shown that the same 
theory can be applied to a good approximation in this case, provided that 
for t p we use the actual lifetime in the sample. 5 We shall discuss lifetimes 
in filaments in the next section. 

Precisely similar expressions may be derived for electron currents. 

The values of the coefficients qp„n P /L p and qn p y. n /L n which depend on 
the properties of the material may be expressed in terms of the conduc- 
tivity <r„ of the »-region, <r p of the ^-region, a the conductivity of intrinsic 
material, and the ratio 

b = n„/np. (25) 

In terms of these quantities we have 

a = qn p ni(\ 4- b), (26) 

and if p n <£L n n and n p -C p p , we find 

qn P p n = b<n 2 /(l + b) 2 a n (27) 

qn n n p = b<n 2 /(l + bfa p . (28) 

Using these equations, we may rewrite (22) and a corresponding equation 
for electron current into the ^-region so as to express their dependence on 
d-c bias V§ and the properties of the regions: 

/,oCo) - n ,y, T --(e^ r - 1) 
(1 + by<; n L p q 

-G p0 — (e« v " kT -\) 

q 

- I p .(e qT,lkT - 1) (29) 

6 See W. Shockley, loc. cit., Appendix V. 



316 APPLICATIONS TO TRANSISTOR ELECTRONICS [12.5 

Inoifo) - (1 + hfapLn q (e 1) 

1 
- /».(e« r ''* r - 1). (30) 

The values of G p0 and G n0 [which are readily seen to be the values of the 
low-frequency, low- voltage (V <§; kT/q) conductances] and the saturation 
reverse currents are given by 

be? _ q 

(1 + b) (x n Lp kl 

r - ha ? = J- t (2i\ 

(1 + by<r p L n kl 
The expression for direct current then becomes 

h{Vo) = (G p0 + G n0 ) — (e qV ° lkT ~ 1) 

= (I P s + In S )(e* r ° lkT -l). (33) 

This is the expression plotted in Figure 4.6 and compared with data there. 
In terms of G p0 , the complex admittance per unit area for hole flow may 
be written in the form 

A v = G p + iS p = (1 + iuT p )*G p0 & v °' hT . (34) 

For low frequencies, such that o> is much less than 1/t p , we can expand 
G p + iS p as follows: 

G p + iS p = G p0 e« v ° !kT + i u ( Tp /2)G p0 e« v ° lkT . (35) 

Hence {r p /2)G p0 exp {qV Q /kT) behaves like a capacitance per unit area. 
It is instructive to interpret this capacitance for the case of zero bias, 
F = 0, for which we find: 

C p = t p G p0 /2 = T p qp n y. p /2Lp = q 2 p n L p /2kT. (36) 

The last formula, obtained by noting that r p ix p = qr p Dp/kT = qL p /kT, 
has a simple interpretation: qpJLp is the total charge of holes in a layer L v 
thick. For a small change in voltage v, this density should change by a 
fraction qv/kT so that the change in charge divided by the change in v is 
(q/kT)(qp„L p ) which differs from C p only by a factor of 2, which arises 
from the nature of the diffusion equation. 

Although A p simulates a conductance and capacitance in parallel at low 
frequencies, its high-frequency behavior is quite different. In Figure 12.4 
the behavior of (1 + /wt) h = J P /G p0 , is shown. For high frequencies 
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G p and S p are equal: 



G p — Sp 



V7p~/2G v0 VZ 



ba? VZ 



(1 + i>) 2 a n V2D 
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(37) 



Thus for high frequencies the admittance is independent of t p and is 
determined by the diffusion of holes in and out of the w-region. The three 
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Fig. 12-4 — Real and Imaginary Parts, G and S, of the Admittance for Hole Flow 

into the »-Region. 

(a) Shows 10 3 Jp/G p0 = 10 3 (1 + iar P ) y2 , and corresponds to equation (34). 

(b), (c) and (d) correspond to cases in which the w-region has varying prop- 
erties.* 

* See W. Shockley Bell Syst. Tech. J. 28, (1939), Figure 7 and associated text. 



straight asymptotes have a common intersection at the point G p0 , oit = 2 
on Figure 12.4, a fact which is useful in estimating the value of r from such 
data. 

The values of I ps + I ns deduced from the known conductivities of the 
material composing the junction and values of t, which is a sort of average 
of t p and t„, are found to be in general agreement with the values found from 
the d-c rectification curve within limits imposed by the approximations of 
the model. At the time of writing, experiments were being undertaken to 
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determine L p and L n directly by photoconductive experiments with a view 
to obtaining separately the I p and I n components. The fit for the rectifica- 
tion formula itself, equation (33), is discussed in connection with Figure 4.6. 6 

12.6 LIFETIME IN FILAMENTS, AND THE SUHL EFFECT 

12.6a. Introduction. A number of experiments have shown that the 
recombination process in germanium filaments often takes place largely on 
the surface and that this recombination may be enhanced by applying 
simultaneously a longitudinal electric field and a transverse magnetic 
field. This latter effect arises from the increased recombination on the 
surface due to the density there of the injected carriers, the Suhl effect 
discussed in Section 3.2. 

In this section we shall solve the continuity equation for a rectangular 
filament for the case in which a longitudinal field is present and show how 
surface and body recombination enter into the equations for lifetime and 
decay constant (= 1 /lifetime). We shall treat only certain limited 
aspects of the Suhl effect because the analysis, although straightforward, 
is somewhat lengthy. 

Throughout this section we shall deal with the case of holes injected into 

n-type material in small densities. For this case arguments similar to those 

of the last section apply, and we conclude that, since the changes in n are 

small, 

kT 
ty — <Pn — — In n n /ni — constant. (1) 

1 

The transverse currents in the filament are equal and opposite for holes 
and electrons, since otherwise a net change would build up on the surface; 
consequently, the gradients of ip n and <p v must be in the ratio of p to n 
[see equations (20) and (21) of Section (12.4)]. Accordingly, <p n and ^ 
must change by negligible amounts across the filament, and the hole cur- 
rent must flow largely by diffusion. 

This same conclusion may be reached less analytically as follows: Sup- 
pose a few holes are injected. They will set up a slight electric field. 
This field will draw in electrons to neutralize the charge of the holes. 
However, the electron density is much larger than the hole density so that 
the required field need alter the electron distribution only very slightly. 
The field must, therefore, be so small that the electrons move almost as if 
no holes were present at all. Hence to a first approximation the holes can 
be considered to move in the same field as would exist if no holes were 
present. The effect of the electrons, so far as the holes are concerned, is 
simply to neutralize the charge density of the holes so that they are shielded 
from the mutual repulsion of their space charge. 

We shall consider the effect of surface recombination upon the lifetime 

6 These results were planned for publication, probably in the Physical Review, for late 1950. 
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for two cases. In the first case we shall imagine that the distribution along 
the filament is uniform, a situation which might be produced by illumi- 
nating the filament by a flash of X rays, for example. After this, we shall 
show that the hole density decays as 

exp (— t/rpi), r P i — lifetime in filament, (2) 

independently of the presence of an electric field. The lifetime is by 
definition t p \ or the time required for decay by a factor of e — l to take place. 
In the second case, we shall imagine that holes are being steadily in- 
jected at some point and are drawn down the filament, decaying by recom- 
bination as they go. For this case an attenuation constant a will be 
introduced, the hole density varying as 

e~ ax , a = attenuation constant. (3) 

We shall define a lifetime as the time required for a hole to drift under the 
field a distance \/a corresponding to a decay of e _1 in the density. This 
procedure leads to 

t P 2 = (l/a)/n p E x = \/an p E x . (4) 

We shall show that, for large electric fields, so that motion along the fila- 
ment arises chiefly from E x , the values of t P 2 and r v \ are equal. We shall 
find it less convenient to deal with the lifetimes than with the decay con- 
stants, defined as 1/t p i and 1/t p2 . In terms of decay constants, the effects 
of surface and volume recombination are found to be additive so that we 
may write 

— = - + .-. (5) 

where 1/V P is the volume decay constant and v s is the surface decay con- 
stant. 

12.6b. Continuity Equation and Boundaiy Conditions. We are, there- 
fore, led to investigating the solution of the continuity equation [(15) of 
Section 12.2] for holes in a germanium filament in the presence of a uni- 
form electric field, which we take to lie in the ^-direction, along the fila- 
ment. We shall take the boundaries to lie at y = ±5 and z = ±C so 
that the cross section is rectangular with area 4BC. The hole current has 
components 

dp 
h = qvppE x — qDp ■— (6a) 

ox 

Iy=-^vT (6b) 



by 
dp 
Hz 



I z =-qD p ^f- (6c) 
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These must satisfy the continuity equation 



dt t p q 

Pn - P „ dp , j. /d 2 p . d 2 p , d 2 p 



r p ->> E *7 X + D »(^ + ~P + W' (?) 

Evidently p = p n is the solution of this equation corresponding to i>o 
injected holes. We shall introduce the symbol 

Pi= P ~ Pn (8) 

to represent the disturbance due to hole injection. 

In terms of pi, equation (7) is a homogeneous linear partial differential 
equation since each term involves pi linearly. Furthermore, it is sep- 
arable 1 and has solutions that can be written as products of separate 
functions of /, x, y and z. One such solution is 

pi = Q-'t-a* cos fry cos cz (9) 

where in order to satisfy (7) the constants v, a, b and c must satisfy the 
relationship 

v - - + H P E x a + D p (a 2 - b 2 - c 2 ) = 0. (10) 

Tp 

The most general solution of (7) corresponding to arbitrary initial condi- 
tions at t = can be expressed as the sum of a series of terms of the form 
of (9), in general including sine terms as well as cosine terms for y and 3 
and complex or imaginary values for a. 

For the t p \ case described above, if the filament is illuminated equally on 
all sides so that the same value of pi is produced on all sides, only the 
symmetrical cosine terms will be needed. A number of such terms will be 
needed, however, in order that their sum at t = will represent the hole 
density. Each term in the series will thereafter decay with its own 
characteristic lifetime so that the decay of the total number of holes in the 
filament will not be represented by a simple exponential decay but by a 
sum of decaying terms having different lifetimes. This reasoning leads 
naturally to the question: What is meant by the lifetime in a filament 
when the decay is made up of a mixture of lifetimes ? 

The answer is that one of the solutions of the form (9) has a lifetime so 
much longer than the others that after the others have decayed to negligible 
proportions it will still persist. It is this longest lifetime with which we 
shall be chiefly concerned. The other terms can be considered to be initial 

1 A solution can be written as the product of separate functions of the variables, as may be 
seen from the following development. 
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transients which die away rapidly leaving only the slowest decay to be 
observed. 

A similar argument applies to the t P 2 case. For this case we may 
imagine that at x = the hole current distribution in the cross-section of 
the filament requires a superposition of many terms of the form cos by cos cz 
and sine terms as well. All but one of these decay very rapidly as the holes 
drift down the filament under the field and the remaining one, as we shall 
show, gives the same value for lifetime as t p \. 

[The problem of expanding an arbitrary function of x,y and 2 in a series 
of eigenj 'unctions satisfying (7) and the boundary conditions on the surface 
is similar to many other problems of mathematical physics. 2 A detailed 
solution for flow across a p-n junction into a square filament has been 
published. 3 ] 

At the boundaries y = ±5 we shall suppose that the excess of recom- 
bination over generation is directly proportional to the disturbance pi 
with proportionality constant s. Consequently, the current toward the 
surface satisfies 

{\/q)I v = ±s Pl y = ±B (11a) 

{\/q)h = ±spi z = ±C. (lib) 

The quantity s has the dimension of a velocity or length per unit time. 
We shall call it the surface recombination velocity. Equation (11a) states 
that the rate of recombination is the same as if a current of holes of den- 
sity pi were drifting with an average velocity j into the surface and being 
removed. 

The boundary condition for B and —B are equivalent when pi is given 
by (9); and (11a) and (lib) reduce to 

bB tan bB = sB/D p = x v = V tan r\ (12a) 

cC tan cC = sC/D p = X z = f tan f (12b) 

where we have introduced the symbols 

■q ^ bB and f s cC. (13) 

For any value of Xy there is one solution for -q in the range < rj < x/2 
and also one solution in every interval of the form nir to (n + \)ir and the 
same is true for Xz and f. The term having the longest lifetime, as we shall 
show below, is that corresponding to the smallest solutions, denoted by ?;o 
and fo> of (12a) and (12b). In terms of these b and c become r] /B and 

2 See Section 14.5 for an analogous quantum mechanical case and the following references, 
all from McGraw-Hill Book Co., Inc.: L. I. Schiff, Quantum Mechanics, 1949, Section 10; 
P. M. Morse, Vibration and Sound, 1936; E. C. Kemble, The Fundamental Principles of Quan- 
tum Mechanics, 1937. 

3 W. Shockley, Bell Syst. Tech J., 28, 435^89 (1949), Appendix V. 
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fo/C respectively. Equation (10) may then be regarded as an equation 
in two undetermined parameters a and v and may be rewritten with these 
terms gathered together in the form 



v + D p a 2 + n P E x a \- D 



p \B 2 + (?) 



"/• 



(14) 



The terms on the right of the first equals sign are determined by the con- 
stants of the filament, i.e., t p , B, C and s. We shall show below that this 
combination, denoted by v/, is the decay constant for the filament and is 
equal to l/r p i and 1/V P 2- Because of this equality we may introduce a 
filament lifetime 

filament lifetime = 77 = l/v/. (14b) 
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Fig. 12-5 — Variation of the Density of Injected Holes across a Filament. 

In Figure 12.5 we show the distribution of injected hole density across 
the filament for the rj , fo solution on the line z = 0. This distribution 
shows how pi varies with y for three values of %• F° r s = 0, the distribu- 
tion is uniform since no diffusion to the surface is required to balance sur- 
face recombination. For s = «>, the hole density is zero at the surface, 
and a high diffusion gradient produces a flow of holes to the surface where 
they recombine. For an intermediate value of x> the value of pi is finite 
at the surface, and the gradient is just large enough to account for re- 
combination. 

We shall discuss the t p \ case first, for which the density is independent of 
x so that a = 0. For this case equation (14) reduces to 

(15) 



V = Vf 

. The solution pi can then be written as 
' Vfl cos by cos cz (16) 



independently of the value of E x 

Pi = e 

so that the hole density decays exponentially with a decay constant Vf 
which is equal to 1/t p i in accordance with equation (2) so that 



J_ 

Tpl 



1 



= Vf 



+ 



Up \B 2 + (?) 



1 



+ y. 



(17) 
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This equation shows that the decay constant Vf for the filament is the sum 
of the "bulk material" decay constant \/r p plus a surface decay constant 
v e defined by equation (17). 

We can now see why terms involving tj b and f„ with n greater than zero 
can be disregarded for many purposes. For these terms the values of 
t\n and f„ 2 will be at least four times larger than for ij 2 and fo 2 ; conse- 
quently, such terms will decay rapidly compared to the tj , fo term and will 
become negligible in comparison with it, at least in all cases where v a is 
important. The distribution of hole density across the filament for 
these higher order terms is alternating in sign and the rapid decay is a 
result of flow of excess holes in regions of hole deficit. The distribution of 
hole density for the case s = °°, corresponding to p = at the boundary, 
is identical with the displacement of the rectangular drum head of Figure 
5.2. Part (f) corresponds to the j/ofo term. The other parts of the figure 
correspond to using sine as well as cosine functions in pi and have values of 
i\ and f for the sine functions which satisfy equations of the form ij cot t\ 
= x- We shall neglect these higher order terms in the following 
discussion. 

The values of v, for the rj , f terms is 

'•- D >[P + £| (18) 

and this is a function of D p , B, C, and s with s entering through the depend- 
ence of )7o and fo upon %y — sB/D P and xz — sC/D p . Simple forms are 
obtained for large and small values of s. For s — > a> , both rjo and fo become 
7r/2 corresponding to pi =0 on the boundary. For this case 

x 2 D p /l 1\ 

''--rU^cy' ' • (19a) 

For small values of s, equations (12) and (18) reduce so that v s again takes 
a simple form [see Problems for a physical argument that also leads to 
(19b)]: 



* = S (l + £)' 



•0. (19b) 



For a filament of square cross section v s can be expressed as a function of 
the three variables s, B, and D p by a single curve in terms of the following 
equation in which voixy) is the solution of (12a): 

V ° B " = 2[ V obcy)] 2 = 2[r, (sB/D p )] 2 . (20) 



D p 



This relationship is plotted in Figure 12.6. 

Figure 12.6 may be used in combination with equation (17) to cal- 
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culate values of t/ from known dimensions B, body lifetime t p , and sur- 
face recombination constant s. It may also be used to determine s from 
known values of body lifetime t p and filament lifetime r/. Experiments 
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Fig. 12-6 — The Dependence of the Surface Decay Constant v s upon the Surface 

Recombination Velocity s. 

af this sort have been carried out by a number of workers and approximate 
values for s, obtained for various surface treatments of germanium, are as 
shown in Table 12.1. 

Table 12.1 Surface Recombination Constants for Germanium 



sand blasted surface 
etched" 



~10 4 cm/sec 
~2000 cm/sec 
~200 cm/ sec 



a Details of these treatments are planned for publication in 1950-51. 

We shall now show that the same value of t/ is obtained by studying 
the dependence of hole density upon distance along a filament in the pres- 
ence of a high electric field and by converting this distance to time using the 
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drift velocity E x n P = v x . For this case the solution is a steady-state solu- 
tion independent of time so that v in equation (14) should be set equal to 



zero. 



The value o'i a is then found to be 

- v x ±Vc x 2 + AD v v f 



a = 



2D P 



(21) 



The solution of interest corresponds to the plus sign; the minus sign cor- 
responds to decay towards the left (— ^-direction) or diffusion against the 
field. (See Problems.) For large values of v x 2 /D p v/, such as are encoun- 
tered in filaments used experimentally, we obtain by expanding the radical 
of equation (21) in series or by neglecting the D p term in (14) 

a = vf/v x (22) 

which leads to the following value of t v2 , as defined by equation (4) : 

t p2 = l/av x = l/vf = Tf. (23) 

Thus we find that t p2 , as well as r pl , is equal to 77 as defined in equation 
(14). 
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HOLE DENSITY, p, 

Fig. 12-7 — Concentration of Holes near the Surface by the Suhl Effect. 

12.6c. The Suhl Effect. We shall next consider the effect of combined 
electric and magnetic fields upon the lifetime of injected holes. For this 
case we shall again neglect the influence of the injected holes upon the 
electric field; there are, however, two differences from the case treated 
previously. In this case there is a transverse field due to the Hall effect. 
We shall restrict the treatment to small values of H z so that only linear 
terms need be considered. The transverse field E y is then equal to —B n E x 
for the small H z approximation discussed in Chapter 8. Furthermore, 
for the coordinate system shown in Figure 12.7, the drift velocity is turned 
at a Hall angle 6 P from E. Finally, the diffusion current is turned at an 
angle 6 P in respect to the density gradient. [See Section 12.3, equation 
(11).] For the case treated below, the rotation of the diffusion current 
produces an effect dependent on cos (d p + n ). For the small angle 
approximation used here, this effect is negligible. 

If E y is large enough, the field across the filament will be so strong that 
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substantially all the holes will be concentrated on one side. We shall 
restrict the treatment to this case, referred to as the strong Suhl effect, and 
shall choose the origin of thejy-coordinate as the side of the filament towards 
which holes are deflected. This is the case shown in Figure 12.7. 

We shall for brevity restrict the treatment to the case analogous to 
T p i and assume that^i depends only on_v and t; the neglect of z corresponds 
to increasing the z-dimension indefinitely. These assumptions permit us to 
neglect the ^-component of the hole current since this component is inde- 
pendent of x and does not contribute to the divergence. The^-component 
of hole current is then 

Iy(y) = -q» P PlE x (d n + B p ) - qD p ^ ; (24) 

dy 

and at v = 0, corresponding to the lower boundary towards which holes 
are driven, this must satisfy the boundary condition 

-J v (0) = gs Pl (0). (25) 

We shall show that there is a solution for the injected hole density of the 
form 

Pl = e-'i-by (26) 

which attenuates in the +jy-direction, as shown in Figure 12.7, and repre- 
sents hole density near one surface of the filament. In order to abbreviate 
the algebra and bring out the physics we introduce the symbol Vt where 

v t = iipE x (6 p + 6 n ) = ix v E t . (27) 

v t is the transverse drift velocity which a hole would have in the field of 
Figure 12.7. E t is thus an "effective transverse field". Substituting p\ and 
v t into the preceding equations leads to the following expression for the 
boundary condition (25): 

v t -D p b = s (28) 

or 

*=(*,- s)/D p . (29) 

If v t > s, the holes tend to accumulate near y = since they are brought 
down faster than they can recombine; we shall treat this case which repre- 
sents a strong Suhl effect; cases with v t = s and v t < s represent transi- 
tion cases between H z = and H z large and have a more complicated 
behavior. 

The density gradient in Figure 12.7 tends to make holes diffuse away 
from the surface and is thus in contrast with Figure 12.5 in which holes 
diffuse to the surface. The situation of Figure 12.7 occurs because we 
have taken v t > s. The quantity s has the dimensions of a velocity and, 
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for v t > s, holes are swept towards the surface by E t so rapidly that they 
tend to accumulate and set up a back diffusion current. 

The value of v is determined by inserting the expression (26) for pi into 
the continuity equation (15) of Section 12.2: 

|L = _^_I v . /pj ( 30) 

at t p q 

which then reduces to 

v=- + v t b- D p b 2 =- + (*, -D p b)b 

Tp Tp 

= 1 + si s - + „.. (31) 

Tp Tp 

The contribution v s = st of the surface recombination to the decay con- 
stant is thus seen to be directly proportional to b, which, in turn, increases 
rapidly with 

E t = {dp + n )E x = 10- 8 (hhp + HHn)H z E x , (32) 

in the notation of Section 8.8 for practical units. 

The relationship v s = sb of equation (31) has a relatively simple inter- 
pretation. The solution 

Pl = e- vt ~ bv (33) 

leads to a total number of holes per unit area of surface on the xz plane of 

•2B e -"« 

-»*-*» dy = — (1 - e~ 2B6 ). (34) 

b 



I 



The quantity 2Bb is an effective transverse potential due to the magnetic 
field measured in units of kT/q. For v t S* s it may be written as 

IBb = IBupEJBp = 2qBEt/kT. (35) 

For realizable experimental conditions, the contribution of the Vt term to 
2bB may be so large that the term exp ( — 2Bb) is negligible. [If exp 
( — 2Bb) is not negligible, the effect of the boundary at y = IB must be 
included.] The rate of recombination on the surface is 

pts = se- vt (36) 

and this is simply sb times the number of holes given by (34). Since the 
decay constant is the ratio of rate of decrease of holes to total number, this 
shows that sb = v s . 

To put the same result in simpler language: For strong transverse fields, 
the holes flow in a layer with an effective thickness of kT/qE t == \/b so that 
the total number per unit area is pi(0) X (1/b) and the rate of recombina- 
tion is p\(0) X s and the decay constant is s 4- (l/b) — sb. 
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The treatment just presented corresponds to the t v \ treatment of 
Section 12.6a since for it pi was independent of x. 

A treatment like that given to t P 2 can be carried out also when magnetic 
fields are present. The answer is again that r v \ = t P 2, provided that 
d p and d n may be treated as small angles. When they are large, magneto- 
resistance effects set in and the complete mobility and diffusion tensors of 
Section 12.3 must be employed. 

Theories of the sort discussed here have been applied in evaluating the 
surface recombination constant s as discussed in Section 3.2. 4 

12.7 HERRING'S SOLUTION FOR LARGE, INJECTED 
CARRIER DENSITIES 

The formulation of the continuity equation of Section 12.2, the Einstein 
relationship, and the equations for the quasi Fermi levels are applicable to 
hole and electron densities of arbitrary magnitudes (provided degeneracy 
is not encountered). In the applications presented in Sections 12.5 and 
12.6, however, only hole densities which were very small compared to 
electron densities were considered. Under these conditions, the differential 
equations for p were linear, and separable and simple solutions could be 
obtained including the effects of recombination, diffusion, and applied 
electric and magnetic fields. When the hole density is comparable to the 
electron density, the equations become nonlinear and it is much more 
difficult to obtain solutions and to visualize the processes involved. 

An elegant solution for the nonlinear problem has been obtained by 
C. Herring 1 for the case of large-amplitude disturbances of the densities. 
We shall discuss his simplest case which is subject to the following 
assumptions: 

I. n = p + n e = p + N d - N a (1) 

which implies that the net charge density is zero and that the 
donors and acceptors are fully ionized. 
II. Recombination is negligible. 

III. Conditions are uniform in the y and z directions so that all quantities 
are functions of x and t. 

IV. Diffusion currents can be neglected compared to currents produced 
by the field E x . 

Assumption II puts the treatment of this section on a very different basis 
from those of the last two sections in which recombination plays a major 
role in causing the injected carrier concentrations to decay to their equi- 
librium values. For this section, the equilibrium values are without 

4 See H. Suhl and W. Shockley, Phys. Rev. 76, 180 (1949). 
1 C. Herring, Bell. Syst. Tech. J., 28, 401-427 (1949). 
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significance since Assumption II prevents an approach to them. This is 
the reason that in (1) we have used not n n but instead n e , subscript e for 
excess of donors over acceptors. Similarly the conductivity o-(O) = qix„n e 
is not the equilibrium conductivity but simply that corresponding to p = 0. 
(The limiting case of intrinsic material with n e — > leads to <r(0) =0 and 
E = oo but in such a way, as pointed out in a footnote, that the expression 
for the densities velocity is properly behaved.) 

Under these conditions the hole and electron currents are 

I p = quppE (2) 

7„ = qn n tiE, (3) 

where we have omitted the subscript x from the current densities and 
from E. The fraction of the current carried by holes is a function of p and 
can be written 2 as 

P 

np) i i n + i P r\ . /, . in pi 



-H i+ M 



where n has been eliminated by equation (1) and 

(These equations are discussed more fully in Section 3.1.) In terms of 
f(j>), the continuity equation [(IS) of Section 12.2] becomes 

b ± = _l v ./ = -l d JMl = _l^2l^P 

dt q P q dx q dp dx 

= -V{I,p) d f x (6) 

where the densities velocity V{I, p), defined by this equation, will be shown 
later to be the velocity with which the hole density propagates. V{J, p) 
may be written in the form 

/ dj{p) I M P 



V(I,p) = 



q dp qn n n e 



1 + 



MS 



(V) 



In some cases we shall consider I to be independent of time and shall write 
V simply as V{p). For small densities of injected holes, V(I,p) reduces 
to the drift velocity n P E Q of holes in a field 

E = I/qn el x n = 7/«r(0) (8) 

2 /(p). which is used only in this section, is not to be confused with the Fermi-Dirac func- 
tions f and i p . 
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corresponding to / in the «-type material. For large values of p, V{I,p) 
approaches zero. 

It is instructive to compare V(I, p) with the drift velocity v p = En p of 
a hole. E is obtained from the conductivity 



*(P) = ?W> + M»») = '(0) [* + (* + j)£\ 
and the current / by using the equation E — I/a so that 



(9) 



v p {p) = ,x p E = p /* ■ (10) 

qn e y. n ~ 



fc H'+i)S 



For p = 0, this equation, like equation (7), reduces to n p E . For larger 
values of p, we have the relationships 



«r(0) o p (0) 



v p (p) = 0,(0) -^ = F / W 1N _, (11a) 

<r{p) - 



hH)fJ 



which shows that the reduction factor for V(J,p) is the square of that 
for v p . 

Since n and p are related linearly according to equation (1), n may be 
determined directly from p and need not be considered separately. We 
shall also omit a discussion of electrons injected into p-type material since 
that case can be obtained from the one treated by interchanging subscripts 
and writing \/b in place of b. 

We shall next consider an extreme case corresponding to hole injection 
into intrinsic material. For this case we shall show, by direct physical 
arguments rather than by equation (7), that the hole density remains 
stationary in space although the drift velocity of holes does not vanish. 3 
Accordingly we suppose that the hole density is some function po(x) at 
/ = 0; since n e = 0, n also equals po(x) at / = 0. Thus for all values of 
x, the ratio of hole current to electron current is constant and has the 
value \/b, and consequently, the hole current is independent of x. For 
this condition, the net hole current flowing into one side of any interval 
8x is equal to the hole current flowing out of the other side, and the hols 
and electron densities remain constant although both hole and electroa 

3 After considering the meaning of V as discussed in connection with Figure 12.8, the 
reader should verify that the statement above corresponds to showing that as n n — > 0, V 
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flows occur. We may imagine a condition of this sort to be produced by 
producing photoconductivity by a pulse of light shining on a limited 
length of a filament in the intrinsic condition. The region of high density 
would then remain fixed in space while holes flowed through it in one 
direction and electrons in the other direction. 

The mathematical expression of the physical picture described in the 
preceding paragraph is that df{p)/dp = so that the same fraction of 
current is carried by holes at all values of x and consequently V • I P = 0. 
For small values of p in an »-type sample the hole density progresses with 
the hole drift velocity as we shall discuss. 

Actually diffusion and recombination would cause the region of high 
density to spread and decay, and the stationary situation already dis- 
cussed would represent only an approximation. However, it serves to 
illustrate that it is not the hole drift velocity which causes the distribution 
of density to shift but instead the divergence of hole current as required by 
the continuity equation. 

If, on the other hand, the hole density p is so small compared to n that 
the electric field is altered negligibly so far as hole motion is concerned, then 
the hole current is simply 

I v = E q» p p - qv p p (12) 

where v p is the drift velocity of a hole in the field E . For this case the 
continuity equation becomes 

dp dp 

— = ~ v pT> ( 13 ) 

dt dx 

which is solved by any function of x — v p t, 

p = po(x - Vpt), 

which represents translation of the hole density in the +#-direction with 
velocity v p . , 

The reader may verify that, (or p/n n -C 1, equations (6) and (7) reduce 
to (13), whereas for the intrinsic case they reduce to dp/dt = 0. 

For intermediate cases, Herring points out that a solution to (7) can be 
obtained by a geometrical construction which we shall illustrate in terms 
of a pulse as shown in Figure 12.8. In this figure the pulse has a hole 
density at / = represented by an isosceles trapezoid. In part (b), the 
density velocity is plotted as a function of p/n e on the same density scale 
as for part (a), it being assumed that / is constant in time. Herring's 
construction is as follows: Take any point in (a) for which p has the value 
pi; draw a horizontal line from this point with length V{p{)t; the point 
so obtained is a point on the translated pulse. That this process satisfies 



332 APPLICATIONS TO TRANSISTOR ELECTRONICS 

equation (7), 



dp 
dt 



ox 



[12.7 



(14) 



may be seen from (c) which shows that, in time dt, the increase in p at 
constant x is 



VP)* = W(p)dt\ 
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Fig. 12-8— Deformation of a Pulse of Holes Due to Conductivity Modulation. 

(Drawn for b = 2.) 

from which (7) is obtained by dividing by dt. A more general expression 
of the functional relationship between p, x, and t is 

*(P, = x(p, 0) + T V[I{t'), p]dt', (16) 

which is discussed further in the problems. 

It is to be noted that the increase in density velocity for low hole density 
causes the lower densities to go faster than the higher densities so that the 
trailing edge of the pulse becomes infinitely steep, a shock-wave phenome- 
non. As the steepness increases, assumption IV (that is, diffusion currents 
are negligible) becomes untenable and the steep trailing edge tends to 
widen owing to diffusion. As a result, the width of the trailing edge 
reaches a limiting value of the order of 



A* = 



IkT 
qE 



1 - 



1 



1 + 



H)^ 



(17) 



where £ max is the maximum of p at the top of the trailing edge. This 
equation, due to Herring, is obtained by setting the rate of widening due to 
diffusion equal to the rate of narrowing due to the difference in velocity 
between the limiting hole densities in the pulse. (See Problems.) 
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Herring has also considered the effect of recombination, which can be 
taken account of in a relatively simple way (see Problems) and diffusion. 
When nonlinear effects due to conductivity are combined with the diffusion 
mechanism, the problem becomes much more complicated. Steady-state 
solutions of the one-dimensional case in which the only approximation 
made is that of assuming negligible failure of the neutrality equation 
n = p + n e , assumption I, have been obtained by W. van Roosbroeck. 4 

12.8 BAND EDGE ENERGIES VERSUS VOLUME: S lp , 5 ln , S ie 

The method of deformation potentials discussed in Sections 11.3 and 17.6 
makes use of the quantities &i P and &i„, which were defined as the change 
in the band edges & v and & c per unit dilatation. These quantities enter 
the expressions for mobility through equations of the form 

M "~ 3 & lf ?tn»(kT)* (la) 

which was derived as equation (30) of Section 1 1.4. For n n in cm 2 /volt sec 
cu in dynes/cm 2 , electron volts for Sj„ and T in °K this becomes 

n n = 3.2 X lQr 6 c ll (m/m n ) H /6 ln 2 1^. (lb) 

As discussed in Section 11.3, equation (34), the relationship 

|S 1G | = |S ln | + |S lp | (2) 

should apply if the slopes of the band edges have opposite signs as shown in 
Figure 11.4, which has been drawn to agree with the theoretical band struc- 
ture computed by Kimball for diamond and discussed in connection with 
Figure 5.5. 

This relationship has been investigated by Bardeen 1 for diamond, 
silicon, germanium, and tellurium. The methods used to determine how 
&g depends on volume are based on the statistical relationship between &o 
and the product np of electron and hole densities given in equation (19) 
of Section 10.3: 

np = N c N v e^-^ )lkT 

= 4(2irmkT/h 2 ) 3 (m n m p /m 2 )%- SolkT 

= 2.33 X l0 31 T 3 (m n m p /m 2 )Ke- &alkT cnT 6 . (3) 

The most direct way to study &g is simply to change the volume at constant 
temperature by applied pressure and to study resulting changes in np. 

4 Planned for publication in Bell Syst. Tech. /., 29 (4) (1950). 

1 J. Bardeen and W. Shockley, Phys. Rev. 77, 407-408 (1950). This section gives in ab- 
breviated forni the analysis presented in a second paper planned for publication in the fall 
of 1950. 
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The volume also changes due to thermal expansion, and this affords another 
method of studying the variation of &q with volume. These and other 
methods will be discussed. 

The results of these investigations are given in Table 12.2 as the bottom 
row. Values of |d>i„| and \&i P \ are obtained by solving (1), setting 
m n = m p — m; their sum is shown in the row next to bottom. 

It should be mentioned that the signs of the terms in the last row are in 
agreement with the predictions of band theory calculations. According 
to Kimball's calculations for diamond 2 the bands cross at an atomic 
spacing larger than the equilibrium value, so that &g should decrease with 
dilation and &ig should be negative; presumably the same will be true for 
silicon and germanium. Tellurium, on the other hand, is composed of 
long chains of atoms, and the chains are not held very tightly together. 
For this reason a relatively small amount of overlapping may occur at the 
equilibrium lattice constant and the bands may not have crossed, the 
situation being like that shown in Figure 5.4 or that for large values of 
lattice constant in Figure 5.5. 

Table 12.2 Comparison of Si<? Determined by Other Means with 

\&m\ + \&i*\ 

Diamond Silicon Germanium Tellurium 

pc % = 0,(10"* dynes/cm) 2 .... 10.8° 2.0" 1.55* 0.5' 

ix {Electrons) 
"(cmVvolt sec, 295°K) 900" 300" 3500' 530" 

H P (holes) 

(cmVvolt sec, 295°K) >200» 100" HOC 530" 

Mn T^ 45X10 6 15X10 6 180 X10 5 27 X 10 6 

Mp T^ >10X10 6 5X10 6 86X10 5 27X10 5 

|S lc |(ev) 8.8 6.5 1.7 2.4 

|6to|(ev) <30 11.3 2.4 2.4 

|S ta |+|6i,| <39 17.8 4.1 4.8 

5ig(ev) ? -30* -5* +5.0' 

"Longitudinal velocity in [110] direction calculated from elastic constants. See R. F. S. 
Hearmon, Rev. Mod. Phys. 18, 409-440 (1946); (19S0). 

s Calculated from elastic constants; W. L. Bond, W. P. Mason, H. J. McSkimin, K. M. 
Olsen, and G. K. Teal, Phys. Rev. 78, 176 (1950). 

" Estimated from compressibility. 

d C. C. Klick and R. J. Maurer, Phys. Rev. 76, 179 (1949). 

e G. L. Pearson and J. Bardeen, Phys. Rev. 75, 1777-1778 (1949). 

r Unpublished data of J. R. Haynes. 

' Vivian A. Johnson, Phys. Rev. 74, 1255 (1948). 

* K. G. McKay, personal communication. 

* See Text. 



2 G. E. Kimball, /. Chem. Phys., 3, 560-564 (1937). 
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The value of Sie may be obtained by the following procedures: 

(1) Intrinsic Conductivity versus Pressure. In the intrinsic temperature 
range, n and p are each equal to 

m = 4.82 X 10 18 T H (»i n « 1 ,/» 2 ) ,< e- So/2 * 7, cm- 8 . (4) 

Since the mobility varies as T~^, the explicit dependence of conductivity 
arises solely from the exponential factor and 

<n = <r i0 e- &Gl2kT . (5) 

Using Bridgman's data on tellurium, 3 Bardeen 4 concludes that changes in 
cr,o with volume are unimportant compared to changes produced in S>g- 
By a similar procedure Miller and Taylor find S 1G ~ —5.0 electron volts 
for germanium. 5 

(2) p-n Junction Resistance versus Pressure. According to the theory of 
Section 12.5, the current in a p-n junction is 

J = q \~DvPn + Dnpnl [Qqv , kT _ {] (6) 

L L'j, L. n J 

For temperatures near room temperature in germanium, p v and n n are 
nearly constant, each being equal to the appropriate difference between 
donor and acceptor densities. Consequently, since pn is given by equa- 
tion (3), both n p a.ndp„ must vary as T 3 exp ( — & G /kT). If it is assumed 
that the variations of D p , D n , L p , and L n with pressure are small compared 
to the effect of variations in (b G , then equation (6) can be used to interpret 
the effect of pressure upon resistance. The value obtained in this way for 
germanium 6 is in agreement with the value quoted under (1). 

(3) Density versus Temperature. As discussed later in Section 16.4, it 
is found that the constant coefficient in the product np is found experi- 
mentally for silicon to be 32.5 times larger than the value of equation (3). 
Pearson and Bardeen have proposed that this is due to the temperature 
dependence of Sg, which they write as 

MT) = MO) - PT. (7) 

This equation leads to 

e -&o(.T)ikT _ e 0/fc e -s G (o)/fcr (g) 

so that a multiplying factor exp (fi/k) occurs in np. Setting this factor 
equal to 32.5 gives 

= * In 32.5 = 3 X 10~ 4 electron volt/°K (9) 

3 P. W. Bridgman, Proc. Am. Acad. Sci. 68, 95-123 (1933); 72, 157-205 (1938); 74, 
21-51 (1940). 

4 J. Bardeen, Phys. Rev. 75, 1777-1778 (1949). 

6 P. H. Miller and Julius Taylor, Phys. Rev. 76, 179A (1949). 

6 Personal communication from J. Bardeen based on unpublished experiments of H. H. 
Hall and G. L. Pearson. 
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and combining this with the volume coefficient of expansion a v gives 

g ic = /3/a„ = 3 X l(T 4 /9 X 10~ 6 = 33 electron volts. (10) 

For germanium the corresponding factor in np is 3.5 leading to (3 = 
1.0 X 10 -4 electron volt/°K and since a v = 19 X 10~ 6 /°K, the value of 
S 1G is 5.25 electron volts, in good agreement with the values obtained 
by the other methods. 

It also possible to obtain information on the temperature dependence 
of &g from measurements of the dependence of photoelectric effects upon 
wave length as a function of temperature. Such effects have been observed 
by F. S. Goucher and H. B. Briggs. 7 

12.9 DRIFT AND HALL MOBILITIES 

During the course of the preparation of this book, great improvements 
were made in the techniques for studying the behaviors of holes and 
electrons in germanium. By pulsing the sweeping field in experiments like 
those described in Section 3.1, J. R. Haynes was able to lower the duty 
cycle and thus raise the permissible sweeping fields by a large factor. 1 In 
this way, the wave shapes of pulses of injected carriers were greatly sharp- 
ened compared to those shown in Figure 3.9. As a result this technique 
became a precision method which permitted measurement of the drift 
mobilities with an error probably less than 5 per cent. The values obtained 
by this improved technique are shown in Table 12.3. 

An extended series of Hall effect measurements have been carried out by 
G. L. Pearson. He finds that the Hall mobility varies more from sample 
to sample than does the drift mobility. This variation is not fully under- 
stood. Pearson's values obtained using the same samples employed by 
J. R. Haynes together with the spread in value for a number of samples are 
also shown in Table 12.3. As discussed in Section 8.9 and Section 11.4, 
equations (25) and (28c), if the mean free path is independent of the 
velocity of the electrons or holes, as is the case for the deformation potential 
theory of scattering by thermal agitation, then the drift mobility should be 
8/3x times the Hall mobility. As is seen from Table 12.3, the drift mobility 
values do not fall within the same range as those obtained from the Hall 
mobility, and the discrepancy is apparently larger than the probable error. 
We shall discuss a possible explanation for this discrepancy below. la 

All of the measurements just described were carried out using single 
crystals of germanium, mostly those grown by drawing single crystals from 
the melt by the method described by Teal and Little. lb The importance 

7 Unpublished investigations. 

1 Planned for publication in the Physical Review for late 1950. 

la This analysis was presented in brief by G. L. Pearson, J. R. Haynes and W. Shockley, 
Phys. Rev. 78, 295-296 (1950). 

lb G. K. Teal and J. B. Little, Bulletin of the 298th Meeting of the American Physical Society. 
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Table 12.3 Mobilities in Cm 2 /Volt Sec in Germanium 



(8/3ir) times' 1 Drift* Conductivity' 

Hall Mobility Mobility Mobility 

Electrons 2600 ± 300 3600 ± 180 3350 ± 400 

Holes 1700+500 1700±90 

- 100 



° G. L. Pearson, Phys. Rev. 76, 179 (1949). 

6 Reported by J. R. Haynes at the Chicago Meeting of the American Physical Society, 
Nov. 26, 1949. G. L. Pearson, J. R. Haynes, and W. Shockley, Phys. Rev. 78, 295 (1950). 
c G. L. Pearson, J. D. Struthers, and H. C. Theuerer, Phys. Rev. 75, 344 (1949). 



of using single crystals arises from the added resistance of grain boundaries, 
a subject to which we return in the next section. 

An estimate of electron mobility can be obtained from the experiments 
of Pearson, Struthers, and Theuerer, mentioned briefly in Chapter 1, 
dealing with the determination of antimony content in «-type germanium 
by the use of radioactive antimony. These data show that the conduc- 
tivity varies linearly with the antimony content so that the conductivity 
can be written as 

<f = Nd qvn + &0 

where a is the conductivity of a control sample without added antimony, 
and N/ is the antimony content determined by radioactivity. By solving 
this equation for /x K for a number of samples, the value of conductivity 
mobility of Table 12.3 is obtained. There is a considerable spread in the 
data due in part, at least, to the fact that polycrystalline samples were used. 
The conductivity mobility is in much better agreement with the drift 
mobility values of Haynes than with the Hall mobility values of Pearson. 
This agreement is, of course, the result to be expected because, in principle, 
both the drift experiment and the conductivity experiment measure the 
average speed with which the carriers travel down the filament under the 
influence of an applied field. So long as trapping and possibly electron 
hole collisions do not cause the drift mobility to deviate appreciably from 
the microscopic mobility (see Section 8.7), the two measurements should be 
concerned with the same physical quantity. It should, of course, be 
stressed that from the operational viewpoint, discussed in Section 14.1, the 
two measurements are quite distinct and the fact that they agree increases 
the reliance to be placed upon the additional physical concepts associated 
with the theory of holes and electrons. 

Hall mobility values for germanium have been found to fluctuate 
markedly from sample to sample. Figure 12.9 shows a plot of the values 
obtained in various ways over a period of time. Values differing appre- 
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ciably from those quoted in Table 12.3 have recently been reported by 
Dunlap 2 who gives values for (8/3t)hh as high as 3400 cm 2 /volt sec for 
electrons and 2650 cm 2 /volt sec for holes. 

We shall next present a possible interpretation of the apparent fact that 
for w-type germanium the Hall mobility ns n , which we shall take as 
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Fig. 12-9 — Historical Data on Mobility in Germanium. (Since some of the data 
represent unpublished "best values" current at the time, no references are given.) 

(37r/8) 2600 = 3100/cm 2 volt sec, is less than the drift mobility n n , taken 
as 3600 cm 2 /volt sec measured for electrons injected into p-type germanium. 
From equation (25) of Section 11.4 we find that 

(weighted average of t 2 ) 






(weighted average of r) 



> 1. 



2 — 



For the case of a constant mean free path and non-degenerate statistics, 
the value of mh/m is 3ir/8 = 1.18, whereas the ratio 3100/3600 = 0.86 is 
smaller by a factor of 0.73. Thus the experimental values cannot be 
reconciled with theory by any assumed dependence of r(v) upon the 
velocity v of the electrons. It is possible, however, to have values of 



fiH 



< 1 for re-entrant energy surfaces 



in the Brillouin zone in place of the spherical energy surfaces with constant 
effective mass on which the formulae of Section 11.4 were based. 



2 W. C. Dunlap, Jr., Phys. Rev. 77, 7S9A (1950). 
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In Figure 12.10, we show how energy surfaces of a peculiar shape can give 
rise to a great decrease in /Wm- This figure represents the P z = plane 
in the Brillouin zone. For this case the velocity vector for a contour of 
constant energy will vary in magnitude, being greatest where the contour 
comes close to the origin in P-space. If an electric field is applied in the 
^-direction, electrons for the »-type sample considered will be accelerated 
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Fig. 12-10— Re-entrant Energy Contours which Will Lead to Low Values of /iff lv~ 

to the left in Figure 12.10 and will alter the number of electrons in states 
between the energy surfaces. The most important regions will be those 
for which the component of velocity v x is large since f ', the fraction of the 
quantum states occupied by electrons, increases as 

(i f ) ==£m-i)v x E x 

\dt /due to E x ki 

as shown in Section 11.4, equation (7). Two groups of quantum states 
with large negative values of v x are represented as Ssi and 8s 2 in Figure 
12.10. For these, E x produces a rapid increase in the number of electrons. 
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If a magnetic field is applied, a flow of points along the energy surfaces 
will take place in the direction shown by the arrows, and the extra electron 
in Ssi and 8s 2 will be displaced towards 8s i and Ss 2 '. This displacement 
will result in a net increase of the +_y-component of average velocity since 
both 8si and 8s 2 have more positive v y values than 8si and 8s 2 . Thus 
the result 3 is to rotate the average velocity vector in the clockwise direction 
rather than the counterclockwise direction, as would be the case for 
spherical energy surfaces, which would have circular intersections in 
Figure 12.10. Hence the sign of the Hall effect for the case of a few excess 
electrons at lowest energy states of the conduction, band can be that nor- 
mally associated with holes. 

It should be stressed that the electrons near the bottom of a conduc- 
tion band like that of Figure 12.10 are not equivalent to holes. If one 
extra electron were present in the band of Figure 12.10, it would constitute 
a net negative charge and it would move towards the positive terminal in a 
drift experiment. In these respects it would act like the excess electrons 
considered in earlier chapters. Its drift velocity, however, would be 
deflected by a magnetic field so as to give the Hall effect of a hole. Simi- 
larly holes in an almost filled band with surfaces like Figure 12.10 would act 
like holes except that they would give the Hall effect of electrons. We 
shall introduce the terms anomalous electron and anomalous hole to describe 
the behaviors associated with these new concepts. Anomalous holes will 
give the Hall effect expected for electrons, usually called normal Hall 
effect, whereas anomalous electrons will give the Hall effect associated 
with positively charged carriers, usually called anomalous Hall effect. In 
a filament containing both anomalous holes and anomalous electrons, the 
carriers would be concentrated on the opposite side of the filament from 
that shown in Figure 3.10. This would lead to an anomalous Suhl effect, 
and data on the impedance of point contacts, like that of Figure 3.12, 
would be reversed regarding the sign of the magnetic field. 

For a less drastic case than that shown in Figure 12.10, a reduction 
rather than a reversal of sign will occur for mj?/m- This is the proposed 
explanation for the observed ratio of about 0.86. 

That a reversal of sign due to re-entrant surfaces does not occur in 
germanium is shown directly by the sign of the Suhl effect and by the 
mutual consistency of Hall effect, direction of motion of injected carriers, 
sign of the field effect, dependence of conductivity type upon valence of 
impurity, direction of rectification, and thermoelectric effects. That it is 
not unreasonable to expect modifications of the sort described previously 

3 The formula for Hall effect for complex surfaces is given in A. H. Wilson, Theory of 
Metals, Cambridge at the University Press, 1936, equation 236. The geometrical interpreta- 
tion of Figure 12.10 has not in the author's knowledge been previously published. 
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which might lead to a value 0.86 for mh/m can be seen by considering a case 
in which the energy contours have been drawn in three dimensions, Figure 
12.11. Although the surfaces shown are computed for another crystal, 
there is no reason to suppose that similar surfaces do not occur for ger- 
manium. 4 

Further evidence that the energy surfaces in the Brillouin zone are not 
spherical is obtained from magnetoresistance effects. It is found that much 




Fig. 12-11 — Energy Surfaces Near Edge of an Energy Band. [For sodium chloride 
after Shockley, Phys. Rev. 50, 754 (1936).] 

larger effects are obtained by applying magnetic fields than can be accounted 
for by the theory of spherical energy surfaces presented in Section 11.4. 
Recent results of magnetoresistance experiments have been reported by 
Pearson 5 and Suhl. 6 They find for single crystals of germanium that the 
magnetoresistance is highly anisotropic which is further evidence for the 
lack of spherical energy surfaces. 

If the energy surfaces are not spherical, the deformation potential 
theory of the mean free path will need major revisions since scattering by 
shear modes can then be important. It is reasonable to suppose that 
scattering by shear waves will probably be less important than by the 
dilatation waves considered in Chapters 11 and 17 so that the agreement 
between S 1G and |Si„j + |S 1p | of Section 12.8 will probably still be valid. 

4 W. Shockley, Phys. Rev. 78, 173 (1950). 

5 G. L. Pearson, Phys. Reo. 78, 646 (1950). 

6 H. Suhl, Phys. Rev. 78, 646 (1950). 

7 W. Shockley, Phys. Rev.79, 191 (1950). 
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12.10 ON A THEORY OF NOISE 

Considerable progress has been made in understanding the nature of 
noise in germanium. It has been found by H. C. Montgomery 1 that 
noise having the typical (1/frequency) or contact-noise spectrum 2 de- 
scribed in Section 2.2 occurs in germanium filaments. 

These experiments employ specimens having ends so large that any 



CONSTANT CURRENT SOURCE 




PROBES 1 



Fig. 12-12 — Montgomery's Noise Transmission Effect. 



effects due to metal semiconductor contacts are eliminated. Figure 12.12 
represents one of the various types used in these experiments. Shapes 
like that shown, and others of similar complexity for other purposes, have 
been produced with a magnetostrictively driven cutter designed by W. L. 
Bond, 3 some examples being shown in the frontispiece. Under typical 
experimental conditions the filament is supplied by a constant-current 
source so that fluctuations in voltage between any two points of the fila- 
ment may be attributed to a fluctuating resistance. In order to measure 
the noise voltage, arms are cut out of the same single crystal as shown in 
Figure 12.12 and are used as probes. This method eliminates the insta- 
bility and additional noise which often occurs in connection with metal 
semiconductor contacts. The (noise voltage) 2 is found, to a first approxi- 
mation, to consist of a thermal or Johnson noise component plus a term 

1 H. C. Montgomery and W. Shockley, Phys. Rev. 78, 646 (19S0). 

2 The term contact-noise arises from the fact that noise of this type was first encountered in 
connection with contacts: "Resistance Fluctuations in Carbon Microphones and Other 
Granular Resistances", C. J. Christensen and G. L. Pearson, Bell Syst. Tech. J. 15, 197-223 
(1936). 

3 W. L. Bond, Phys. Rev. 78, 646 (19S0). 
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which is directly proportional to I 2 and thus corresponds to a fluctuating 
resistance. Montgomery has obtained strong evidence that for w-type 
germanium this resistance modulation noise is due to a fluctuating hole 
current. The preliminary theory of his effect is as follows: 

In the body of the germanium or on its surface there are regions which 
tend to have higher hole density than other regions. The presence of such 
regions has been observed in unpublished experiments of Haynes which 
show that the pulsed application of a sweeping field sometimes causes 
pulses of holes to pass by the collector point although no emitter contact 
is used at all. Haynes has been able to locate the sources of holes as being 
in certain definite regions of the filament which are not visibly different 
from the surrounding material. It is not known in all cases whether these 
regions are actually .p-type inclusions or simply regions which are less 
w-type than the adjoining parts of the crystal. We shall for the present 
purpose assume that the regions are simply less »-type and thus under 
equilibrium conditions have a density of holes higher than the average. 
In the presence of an electric field, these higher densities are swept away. 
As a result, the regions, depleted of their normal hole density, attempt to 
re-establish the density by the emission of holes which are also swept away 
so that by this mechanism the region acts as a source of holes. If the hole 
current from this source were purely random in character, the noise in the 
filament would be much smaller than that observed and would have a 
"white" frequency spectrum (noise power per cycle independent of fre- 
quency) rather than the 1// spectrum actually observed in the filaments 
as well as in transistors, as discussed in Section 2.2. Accepting for the 
moment the idea, which we shall discuss further below, that such fluctuat- 
ing sources of hole current exist in the filament, let us examine the conse- 
quences of this picture. 

The noise produced by a given fluctuating source of holes will be pro- 
portional to the lifetime of the holes in the filament, in fact the noise power 
will be directly proportional to the square of this lifetime. As we have 
explained earlier in connection with the Suhl effect, the lifetime of holes 
in the filament may be materially decreased by the simultaneous applica- 
tion of magnetic fields and electric fields. Reduction of noise in the fila- 
ment has actually been observed by Montgomery and is in fair quanti- 
tative agreement with the theory based on these ideas. From his experi- 
ments one finds that the data can best be fitted by assuming that the noise 
originates in sources of holes located on the surfaces of the filament. This 
result is consistent with the finding that the surface is the principal locus 
of recombination for holes with electrons. In fact under conditions of 
thermal equilibrium, the principle of detailed balancing requires that any 
region which acts as a locus of recombination also acts as a locus of emission 
to an exactly balancing degree. There is a somewhat similar symmetry 
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between emission and recombination under nonequilibrium conditions; 
thus a region having a fluctuating ability to effect the recombination of 
holes and electrons can introduce noise into an otherwise uniform flow of 
holes in just the same way as can a region of fluctuating emission. 

In addition to the effect of noise reduction by applied magnetic fields, 
Montgomery has observed a noise transmission effect in using a structure, 
like that of Figure 12.12, having three side arms so that three noise volt- 
ages, V\2, V 2 z, and V\z, may be measured by connecting various pairs of 
terminals to the noise-measuring set. If the resistance modulation in 
each of the two segments were independent of the other, then the noise 
powers V\2 and V^z should add to give the noise power F 13 2 . Under 
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Fig. 12-13 — Modulation of the Recombination-emission Process bya Trapped Charge. 



the experimental conditions, however, the lifetime of a hole in the fila- 
ment is so long that most of the holes entering Segment 1 also pass through 
Segment 2 before recombining. Furthermore, the transit time is a small 
fraction of a cycle of the measuring frequency. Consequently, the modu- 
lation due to the hole current should be approximately the same in both 
segments and the noise voltages, rather than the noise powers, should 
add. Confirmation of this prediction is furnished by the fact that Mont- 
gomery finds 



^13 2 = ^12 2 + ^23 2 + 2CV l2 V, 



12^23 



(1) 



with a value of 0.85 for the correlation coefficient C. If the surface is 
treated so that the lifetime is less than the transit time between segments, 
then the correlation between resistance modulation in one segment and 
the other is largely eliminated. Combined with the experiment with 
magnetic fields these experiments with the noise transmission effect con- 
stitute very strong evidence that the noise in these filaments is due to a 
fluctuating hole density. 

In Figure 12.13, we show a possible mechanism which will give rise to 
noise of the sort discussed. We represent a region of the semiconductor 
containing two imperfections having the hypothetical properties described 
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below. One of these is a recombination-emission center, referred to as the 
center. It is supposed that this center has an energy level to which an 
electron may be excited thermally from the valence band. After an elec- 
tron is so excited, as shown in (a), the hole left behind moves off so that 
in this way the center has produced a hole. The electron may then be 
further excited to the conduction band, after which it moves away leaving 
the center in a neutral condition. This process may be repeated so that 
the center can act indefinitely as a source of holes and electrons. Under 
equilibrium conditions, the reverse process, in which an electron falls into 
the center from the conduction band and later from there into a hole which 
happens to diffuse into its neighborhood, will occur with equal frequency. 
If the region shown in (a) and (b) is one which normally has a relatively 
high hole concentration and if the holes are swept out by an applied electric 
field, then the action of the traps can be shown to be such that the region 
acts as a natural source of holes. 

In Figure 12.13 we show a different type of imperfection which can also 
trap an electron. We shall suppose that this trap is much more stable 
so that the electron remains in it for a time very much longer than that 
required for the center to go through a cycle of the sort shown in (a) and 
(b). We shall refer to this trap as a modulating trap or simply a trap} 

Now the ability of the center to absorb or emit hole-electron pairs can 
be modulated by the charge of the trap. If the modulating trap shown 
m the figure becomes charged by trapping an electron, this will alter the 
shape of the potential energy bands as shown in (c) and (d). As a result 
of this change, it is somewhat harder for the hole to escape and somewhat 
easier for the electron to escape. In general, these two effects will not 
cancel and one process will be modulated in a more important way than 
the other. Consequently, the source strength of the centers in the neigh- 
borhood of the modulating trap will fluctuate as the trap gains or loses an 
electron. If the energy levels of the traps are distributed over a con- 
siderable energy range, then the frequency spectrum characterizing their 
charge and discharge by capturing and emitting electrons will be of the 
1// type. 5 

The case represented in Figure 12.13 is only one of a number of similar 
possibilities. Furthermore, Montgomery's experiments with magnetic 
fields indicate that the noise sources are on the surface of the filament. 
For this case the centers may well be surface states and the traps may also 
be surface states of a different sort or, alternatively, the modulating role 
of the traps may be played by ions or polar molecules moving on the 
surface. 

4 Traps play a major role in the theory of phosphors. For discussion and references see 
H. W. Leverenz, Luminescence of Solids, John Wiley and Sons, Inc., 1950. 
5 1 am indebted to J. Bardeen for calling my attention to this conclusion. 
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It should be stressed that the sources of noise discussed here, like the 
other imperfections mentioned earlier in this section, are apparently not 
necessary attributes of n- and />-type germanium. In the idealized model of 
n- and />-type germanium, consisting of a perfect lattice except for sub- 
stituted atoms of valences three and five, there appears to be no important 
source for noise of the type actually observed. It is to be hoped that with 
improvement in the art, real germanium specimens will be produced so 
close to the idealized model that for them the noise will be reduced towards 
the minima set by Johnson noise and space charge limited shot noise. 

12.11 CONCLUDING COMMENTS 

In this section we shall review briefly the status of transistor electronics 
and shall indicate areas in which it is expected that scientific and technical 
progress may occur in the future. 

As mentioned many times in the text, the invention of the transistor by 
Bardeen and Brattain has opened up a new area of research in semicon- 
ductor phenomena. The experiments of Haynes on drift mobility, of 
Pearson on conductivity modulation, of Suhl on magnetic concentration 
of holes and electrons constitute, the author believes, foundation stones 
in the entire theory of electronic conductivity in semiconductors. These 
experiments show that many aspects of the conceptual pictures of the 
behavior of electrons and holes have a real operational meaning, quite 
apart from that previously explored by measurements of conductivity, 
Hall effect, and thermoelectric effects. 

In transistor electronics the germanium filament (see Frontispiece) 
plays the role of the vacuum or gas discharge tube of conventional elec- 
tronics. The analogue of the thermionic cathode is the emitter contact 
junction, and its efficiency is the fraction y of its forward current carried 
by injected carriers. In Chapter 3 data were quoted for several cases in 
which y was measured. Since the time at which those studies were made, 
the estimate of b has been revised upwards from 1.5 to 2.1, germanium 
side arms have been used as probes, and a-c techniques for injection and 
measurement have been employed. W. H. Brattain, who introduced the 
last two improvements in the studies of y, has made measurements on an 
w-type germanium. He finds values of y varying from as low as 0.2 to 
nearly unity with a tendency for y to increase for a given point with 
increasing emitter current. The value of y is dependent upon surface 
treatment but not in a readily reproducible way. The apparent value of 7, 
which is measured as the ratio of small a-c values i P /i ( , increases with 
increasing sweeping field and Brattain's values are extrapolated to infinite 
field. 

In addition to the measurements of 7 and the new observations of 
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fundamental phenomena made possible by filament techniques, other de- 
tailed information has been acquired concerning the behavior of holes and 
electrons. It has been shown that usually recombination takes place 
chiefly on the surface of the filament. It is also known that a hole in gen- 
eral will have many thousands or tens of thousands of collisions with the 
surface before it is finally lost. It appears that this surface recombination 
process is structure-sensitive to the chemistry and potential fields at the 
surface. 

It is also known that other impurities and imperfections in the crystal 
besides simply elements of columns 3 and 5 of the periodic table, such as 
were discussed in connection with Table 1.1, are involved in the recombina- 
tion process in the body of the germanium. It is found that this body 
recombination is relatively insensitive to the presence of ordinary donors 
and acceptors but can be markedly increased by the same sort of heat 
treatment which tends to convert «-type germanium to p-type. 1 It has 
also been found that the p-type centers produced by bombardment with 
nuclear particles are very effective in aiding the recombination of holes 
and electrons. 

It is to be hoped that the nature of these imperfections will be estab- 
lished with the aid of low temperature studies, lifetime measurements, 
and studies of noise and of trapping phenomena. 

Grain boundaries in germanium introduce electrical discontinuities, as 
mentioned in Section 12.9. Their principal features have been reported 
by Pearson. 2 He finds that some grain boundaries act like p-type regions 
so that the electrical characteristics are like those of two p-n junctions in 
series; this picture is further supported by the photovoltaic effects, which 
are also consistent with this view. It is not certain whether the p-region 
at the boundary is due to segregation of chemical impurities or to disorder 
of a type similar to that produced by nucleon bombardment, but arising 
from atomic misfit between crystals on opposite sides of the grain boundary. 
If the germanium is converted to p-type by heat treatment, the grain 
boundary apparently acts simply like a very thin layer of somewhat differ- 
ent conductivity and thus has no appreciable effect upon the resistance of 
the specimen. 

Great progress has been made in the germanium material itself under 
the pressure of demands of the transistor program. As a result, relatively 
large crystals of high perfection and chemical purity are now available, 
and experiments in transistor electronics which would have been impos- 

1 Unpublished investigations carried out by M. Sparks and J. R. Haynes. 

2 G. L. Pearson, Phys. Rev. 76, 459 (1949). See also N. H. Odell and H. Y. Fan, and 
W. E./Taylor and H. Y. Fan, Phys. Rev. 78, 334-335 (1950). The theory of the structure 
of grain boundaries for some of the simpler crystals appears to be in accord with experiment. 
See W. T. Read and W. Shockley, Phys. Rev. 78, 275-289 (1950). References. 
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sible in the past can now be performed. A preliminary report of crystal 
growing has been made by G. K. Teal and J. B. Little. 3 



At this point we shall briefly summarize the aspects of the energy band 
theory of 'conduction which have been tested in connection with the 
transistor program. According to the energy band picture an excess 
electron 

(1) has a charge of — e 

(2) undergoes random diffusive motions 

(3) drifts under an electric field with a characteristic mobility 

(4) is deflected sidewise by a magnetic field. 

A hole has similar attributes except for the sign of its charge. The new 
experiments have put all of these aspects on a firm operational footing. 
The sign of the charge and the drift velocity have been verified by carrier 
injection experiments. The magnitude of the charge has been confirmed 
by the rectification formula for p-n junctions which depends critically upon 
e/kT. The sidewise deflection has been directly observed in the Suhl 
effect experiments. The diffusive motion has been verified by experiments 
on p-n junctions in which the frequency characteristic is found to be in 
accord with the diffusion theory; it has been still more directly verified in 
an experiment carried out by Goucher 4 in which carrier injection by light 
absorption is followed not by drift in a sweeping field but by diffusion in a 
negligible electric field. 

These attributes are the principal consequences of the physical picture 
of holes and electrons discussed in Part I. In a quantitative form this 
picture furnishes the mathematical discipline required for the theoretical 
side of transistor electronics. It is, therefore, of importance to appreciate 
that as a result of the new experiments growing out of the transistor 
program, the important features of the physical picture are now based on direct 
experimental evidence and need no longer be considered to be deductions 
from an involved quantum mechanical analysis. 

Further evidence for the validity of the band theory is furnished by the 
correlation, discussed in Section 12.8, of band spacing or energy gap with 
size of the unit cell. An additional piece of evidence is the fact that 
approximately unit quantum efficiency, that is one hole-electron pair per 
photon, is obtained in photoconductivity experiments 5 ; this result is just 
what should be expected on the basis of the physical picture of excitation 
by photons described in Chapter 1. 

3 G. K. Teal and J. B. Little, Phys. Rev. 78, 647 (1950). 

4 F. S. Goucher, planned for publication in the Phys. Rev. in 1950. 
6 F. S. Goucher, Phys. Rev. 78, 476, 816-817 (1950). 
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It may be appropriate to speculate at this point about the future of 
transistor electronics. Those who have worked intensively in the field 
share the author's feeling of great optimism regarding the ultimate poten- 
tialities. It appears to most of the workers that an area has been opened 
up comparable to the entire area of vacuum and gas discharge electronics. 
Already several transistor structures have been developed and many 
others have been explored to the extent of demonstrating their ultimate 
practicality, and still other ideas have been produced which have yet to 
be subjected to adequate experimental tests. It seems likely that many 
inventions unforeseen at present will be made based on the principles of 
carrier injection, the field effect, the Suhl effect, and the properties of 
rectifying junctions. It is quite probable that other new physical prin- 
ciples will also be utilized to practical ends as the art develops. 

The preparation of this book was motivated by the conviction that 
electronic conduction in semi-conductors was about to take its place as a 
subject of major importance in the communications art and other branches 
of electrical engineering as well. Workers in transistor electronics had 
expressed a need for a new treatment of the physical principles in descrip- 
tive terms with special emphasis upon the features likely to be used as the 
basis for engineering design. Part II has had as its chief aim the supplying 
of this description and some of the mathematical tools useful in design 
considerations. A re-examination of the foundations of the basic theory 
constitutes one of the major aims of Part III, the other aims being con- 
cerned with specially adapted presentations of certain well-known branches 
of theoretical physics. The more fundamental investigations, because of 
their strongly mathematical flavor, may be inaccessible to most readers. 
It is hoped, however, that they will convey the important impression, 
which the author believes to be true, that the substratum of basic theory 
is sound for transistor electronics. 

Problems 

1. Show that if at t = there is a distribution of holes on the plane at 
x = with N v holes per unit area, then owing to diffusion (neglecting all 
other processes) the value of p at later times is 

Consider the three-dimensional case and show that if N p holes are con- 
centrated at r = at t = 0, then the density at time / is 

D( r a = Np e -ryiBt 

PKr ' t] (4tD/)» 

(This may most easily be done by testing the solution in the continuity 
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equation using rectangular coordinates. See problems of Chapter 11 for 
the integrals.) 

2. Show that, for the one-dimensional diffusion of Problem 1, the value 
of (x 2 ), denned as 

(x 2 ) = f " x 2 p{x,t)dx/N p 

is 

(x 2 ) = 2Dt. 

Verify that for any function p(x, t) satisfying 

dt dx 2 

and appropriate boundary conditions 

4 (x 2 ) = 2D. 
dt 

Extend these results to two and three dimensions. 

3. Show that if a plane of holes as in Problem 1 is generated (by the 
photoconductive effect for example) at / = in a medium of hole lifetime 
t p , the solution of the continuity equation for the extra holes is 

p(x,t)=-^=e-*' /iDt - t/ ". 

Show that the relative number of holes which recombine in time dt is 
[exp (—t/Tp)]dt/T p . Show that the value of x 2 averaged for each hole 
from generation to recombination is 

(x 2 ) = 2Dt p . 

Obtain this same result by averaging x 2 over the d-c part of the solution 
(16) of Section 12.5. Show by a physical argument that the two results 
should be the same. Show that the average distance, not (distance) 2 , 
diffused by a hole is 

J-tp == V UTp 

for both solutions. 

4. Consider a pulse of holes formed at the point r = at ( = in a 
field E and then allowed to spread and drift owing to diffusion in the field. 
Show that the distribution at t is like that of Problem 1 but centered at 
r = tfipE and that the ratio of distance spread sidewise to drift distance is 
approximately 

sidewise spread _ J 2kT 
drift distance V eV 
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where F = E ■ (drift distance) is the difference in voltage between the 
initial position and the position at time /. This formula is useful for 
estimating how far down filament holes must flow before filling the cross 
section and for other similar applications. 

5. Compare the relative importance of the conduction and diffusion 
terms for hole flow in the «-region as mentioned in connection with equa- 
tion 20 of Section 12.5. To do this consider a plane, near the transition 
region, at which we may consider I p and /„ to be approximately equal. 
Show that this leads to 

dx n n qL p 

and hence that diffusion currents are the dominant process for hole flow 
unless Vq is so large that the injected hole density is comparable to n n . 
Show that for holes in germanium with y. p = 1700 cm 2 /volt sec kT/qL p 
is equal to 3.90 (10 6 r p )^ volts/cm or 3.90 volts/cm for r v = lju sec. 

6. Consider surface recombination for the case of s —>0. Show that 
the formula (19b) of Section 12.6 



&+-a 



.B + 

can be obtained by calculating the rate of surface recombination assuming 
that p = const, across the filament and evaluating (total rate of recom- 
bination) -5- (total number of holes present). 

7. Consider the neglected solution of equation (21) of Section 12.6 
which was said to represent diffusion against the field. Show that, if 
recombination is neglected, this solution reduces to the Boltzmann con- 
centration factor discussed in Section 12.3, equation (2). 

' 8. Show that in considering the Suhl effect of Section 12.6 the continuity 
equation (30) can be separated if a factor cos bz is included, the effect being 
to add a term D p (po 2 /C 2 to v in equation (31), the boundary conditions for 
<Po being the same as for H z = 0. 

9. Calculate the transverse field E t and transverse voltage for a filament 
20 mils square (that is, 0.05 cm on each edge) with E x = 10 volts/cm and 
H z = 5000 oersteds for p- and »-type germanium. 

10. Assume an effective mass equal to the electron mass and show that 
the number of electrons having velocities in a range dv x dv y dv z and lying in a 
volume of space dV\s according to approximation (2) of Section 10.1 

dn = V*kT/m)» e ^ w+ ^ H . 
n 

Show that the electrons with velocity components v x v y v z striking an element 
of area dxdy in time dt come from a volume dxdyv z dt. Integrate this over 
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all velocity classes and obtain for the rate at which electrons strike dxdy 
the formula 

' hT , — 

= nv T / Vk = 0.282»p r 

where v T is the velocity used in Section 11.4, equation (20), and is related 
to T as for Figure 10.2. 

11. Apply the result of the previous problem to show that the number 
of collisions with the surface before recombination occurs is given by 

Number of collisions per recombination = 0.282»rA 

where s is the surface recombination velocity of Section 12.6. Show that 
this formula will be valid for values of s < 10 5 cm/sec. Show that for 
s = 10 3 cm/sec there are ~2800 per recombination. 

12. Show by a geometrical argument that for the construction of Figure 
12.8 the total number of holes is independent of time. 

13. Derive the partial differential relationship 

/dp\ = _ /a*\ //dx\ 

\dtj x \dt/p/ \dpJt 

which must hold whenever x is a function of p and /. Apply this to the 
relationship 

x(p, t) = x(p, 0) + £ V[I{t'), p]dt' 

and verify that this satisfies equation (6) of Section 12.7 for the case for 
which the current is /(/). 

14. Derive the equation 

d_E = EdI_ y(E) 9E 
dt I dt dx 

where according to equation (9) of Section 12.7 

E = I/c{p) 
and consequently 

Show that if E is plotted as a function of x a pulse with an infinitely sharp 
leading edge at t = has a parabolic leading edge for / > 0. 

15. Consider as an approximation that the trailing edge of the shock 
wave corresponds to the solution for x < of Problem 1, 

p{x,t) = p (4TrDt)-*e-* /iDt . 



Chap. 12] PROBLEMS 353 

Show that this tends to spread to the rear at a rate of approximately 

^$ = VWt = 2D/(x). 
at 

Equate this to the rate at which the concentration p = tends to catch 
up with p = p max and obtain the formula for the width of the trailing 
edge quoted at the end of Section 12.7. 

16. Show that a solution to the equations of section 12.7 can be ob- 
tained for / = constant and constant lifetime r p by moving each point 
p(x Q , 0) of the initial distribution, like that of Figure 12.8, along a curve 
defined by ; such that 

p{xo,t) =p(x ,0)e- t/T » 



x = x + f F[I,p(x ,t'))dt'. 
Jo 



17. Internal Contact Potentials. 1 Consider a filament of w-type ger- 
manium with s = (no surface recombination). Suppose that a contact 
is made to this with a probe of «-type germanium but that the two are 
separated by a thin layer of p-type germanium so that the probe is sep- 
arated from the main body by two p-n junctions. Suppose that holes are 
injected into the filament so that <p p > <p„. Then by applying reasoning 
like that of the first pages of Section 12.5, show that the probe tends to 
acquire a potential between <p p and <p„. In particular, if <p' is the Fermi 
level deep in the probe where <p p ' — <p n show that <p n < <p < <P P - Show 
that if the p-layer is converted to rc-type, by imagining that Nd is increased 
for example, then <p — > <p n . 

18. The treatment of the p-n-p transistor 1 constitutes a further applica- 
tion of the method of this section. 

19. The collector currents used in the experiments on which Figure 3.6 
is based are sufficiently large so that diffusion currents are negligible com- 
pared with conduction currents. Show that the ratio, I n /I P , of the elec- 
tron current to the hole current flowing to the contact is then equal to 
bn/p. Assuming b = 2, and using the data of Figure 3.6 calculate the 
"intrinsic" a for each of the four contacts. (This method is due to 
Bardeen.) 

1 See W. Shockley, Bell Syst. Tech. J. 28, 435-489 (1949) for a further discussion. 
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CHAPTER 13 
INTRODUCTION TO PART III 

The topics dealt with in this book are in principle, although not in 
actuality, directly derivable from the basic laws of quantum mechanics. 
If complete calculations were available, a strictly logical development 
would start by stating these basic laws, probably with accompanying 
references to their historical origin and experimental verification in simple 
cases. For the case of germanium, the laws would then be applied to ele- 
ment number 32. From this starting point the lowest energy state for a 
large number, say 10 22 , of germanium atoms would be determined. This 
lowest state would correspond to a crystal having the diamond structure 
and, by comparing its energy with the energy of isolated neutral atoms, the 
strength of the Ge-Ge covalent bond would be found. 

Next all the excited states would be calculated and classified in terms of 
electron motions and atomic vibrations. 

The effect of temperature upon the system would then be obtained by 
applying the principles of quantum statistics to the excited states. Com- 
bining these results with an investigation of the effects of applied electric 
and magnetic fields would lead to a theory of the conductivity and Hall 
effect of a-pure or intrinsic germanium sample. If we had started with an 
admixture of 10 15 atoms of number 51 (antimony), the results would 
apply to an «-type sample. If, in addition, we considered cases in which 
the impurities were spatially distributed in special ways, this fundamental 
approach would lead to interpretations including the rectification of 
p-n junctions and the amplification of p-n junction transistors. 

A procedure of the sort just described is neither practical nor, for that 
matter, desirable. Calculations of the lowest and some of the excited 
states have been made for diamond and for silicon. The numerical re- 
sults, however, are not in very gratifying accord with experiment and could 
not very satisfactorily be used for design purposes. The important con- 
tribution which the theory makes is to provide a conceptual framework 
on which to hang experimental facts so as to give them a more fundamental 
interpretation and to explain and predict relationships among them. For 
this purpose an exact theory, which specified the correct quantum states 
or eigenfunctions for the entire system, would be, owing to the com- 
plexity of its results, far inferior to an approximate theory, which uses 
much simpler pictures and takes into account their inexactness in terms of 

357 



358 INTRODUCTION TO PART III 

various corrections which can also be simply understood. It is chiefly to 
show how these approximate solutions arise from fundamental quantum 
mechanics and how they are interpreted that this part of the book is 
written. 

In the following chapters we shall deal with almost all the problems 
involved in the fundamental or first-principles treatment. The emphasis 
and level of presentation, however, differ greatly from one topic to another. 

The treatment of quantum mechanics in Chapter 14 is at a very practical 
level and will probably be of value chiefly to readers with engineering 
training. Unlike most treatments, the emphasis is laid from the outset 
upon the application of quantum mechanics as a tool for calculating re- 
sults. This emphasis is particularly appropriate since in the field of 
transistor electronics, the design of useful devices may be considered to 
be based on the quantum physics of semiconductors. 

In addition to presenting the fundamental laws and formulae of quantum 
mechanics, Chapter 14 contains applications to some simple problems and 
compares their mathematical forms with the mathematical forms of cor- 
responding circuit theory analogues. The Bloch wave functions for a 
general form of a one-dimensional crystal are derived. The chapter closes 
with an indication of how the method is extended to a general three- 
dimensional crystal. The problem of constructing the Brillouin zones for 
general crystals is somewhat lengthy and does not bring out any particu- 
larly important new physical ideas. Furthermore, extensive treatments 
of the topic are found in several texts. For these reasons, no illustrative 
examples of Brillouin zones for the diamond structure or other crystal 
structures are presented. The reader interested in additional details will 
find a list of references at the end of Chapter 14. 

The kinematical and dynamical properties of electrons and holes, which 
were discussed in Chapters 6 and 7 on a descriptive level, are derived as the 
consequences of quantum mechanics in Chapter IS. The first three sec- 
tions of that chapter are concerned with deriving the group velocity formula 
and employ relatively simple mathematical techniques. The later sections 
are on a more advanced level. Two methods of dealing with holes and 
electrons are used. The first of these, given in Section 15.6, deals with 
a wave-packet and shows that the packet simulates the behavior of a 
particle obeying the laws discussed in Chapter 7. The following two sec- 
tions show how wave-packets for holes are constructed and interpreted, 
the results derived being in agreement with the treatment of Chapter 7. 
In Section 15.9 the other method of dealing with the acceleration of an 
electron is presented. In this latter case true periodic boundary condi- 
tions, along one direction at least, are introduced by imagining that a thin 
crystal is bent into a loop. Acceleration of the electron around the loop 
is produced by employing the loop as a short-circuited secondary of a 
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transformer. This case has the conceptual advantage of representing a 
situation in which the system can be discussed, in principle, in funda- 
mental quart tum-mechanical terms while steady currents are flowing around 
the loop. This model thus represents a step along the fundamental path 
in which the behavior of the semiconductor would be deduced directly 
from quantum mechanics applied to a particular system. The treatment of 
the Hall effect on the basis of this model is omitted since it is treated for 
the wave-packet case of Section 15.6. 

The effects of thermal agitation, used in Chapter 7, are discussed more 
fully in Chapter 16. Statistical mechanics in general is the subject of 
many texts, and the treatment here is not intended to be exhaustive. 
Simple derivations of the Fermi-Dirac distribution for electrons and for 
the energy distribution of harmonic oscillators, the Planck distribution, 
are presented. This treatment is based on the use of approximate rather 
than exact quantum states and furnishes an example of the principle stated 
earlier that exact solutions may not always be necessary or even desirable. 
This treatment is followed by a detailed application of the statistics to the 
semiconductor problems of particular interest in this text. 

Chapter 17 deals with the interaction of holes and electrons with vibrat- 
ing atoms in the crystal. The quantum-mechanical theory of transition 
probabilities is applied to this situation and it is found that the mechanism 
of scattering by thermal agitation is closely related to the dependence upon 
lattice constant of the energy gap between conduction and valence bands. 
Section 17.7, the closing section, extends the results derived for one electron 
to the case of a hole and to the intermediate case of many holes and many 
electrons. 

As indicated by the title of this part of the book, the analysis presented 
here furnishes a foundation upon which the descriptive theory of Part II 
is based. The most extensive use of this foundation is required for the 
mathematical theory of conductivity and Hall effect presented in Chapter 
11. It may be helpful to point out here how the chapters of Part III 
contribute to Chapter 11. In Chapter 11, the electrons (or holes) are 
considered initially to be distributed in the Fermi-Dirac distribution; 
we may imagine for this purpose that the model of the crystal bent into a 
secondary transformer turn is used. Under the influence of an electric 
field, the distribution is disturbed, each electron being accelerated as dis- 
cussed in Section 15.9. As a result, the principle of detailed balancing 
(Section 17.2) no longer applies and equilibrium tends to be re-established. 
The details of the way in which equilibrium is re-established constitutes 
the principal problem of Chapter 11. 

The foundation which supports Chapter 11 is not quite complete. In 
Chapter 15, the effect of fields in the absence of thermal agitation is pre- 
sented while in Chapter 17 the effect of thermal agitation in the absence 
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of fields is presented. In the application in Chapter 11, it is assumed that 
these effects can be directly added. The additivity can be proved by 
extending the theory of transitions of Section 17.2 to the case in which 
the electron is being accelerated. The results for an electric field are 
somewhat different in form from the Weisskopf-Wigner treatment; the 
physical content, however, is equivalent to adding the two effects just as 
is done in Chapter 11. Thus it is possible to furnish this missing portion 
of the foundation structure. The treatment, however, would add little 
which was new from a conceptual point of view and is omitted in the inter- 
est of brevity. 

Similar objections may be raised to the use of the scattering formulae 
of Chapter 17 to wave-packets being accelerated as discussed in Section 
15.6. Again the answer is that it is quite possible to show that the'scatter- 
ing results of Chapter 17 could be extended to cover an accelerated wave- 
packet and thus to prove the desired additivity result. However, the 
added complexity and length of exposition would probably be of very 
limited interest to most readers; and interested readers, on the other hand, 
would be more benefited by carrying out the process themselves. 



CHAPTER 14 

ELEMENTARY QUANTUM MECHANICS WITH 
CIRCUIT THEORY ANALOGUES 

14.1 ON THE NATURE OF AN APPLIED 
MATHEMATICAL THEORY 1 

Quantum mechanics is recognized as one of the more difficult branches 
of theoretical physics, chiefly because it employs certain abstract mathe- 
matical tools which in themselves have no direct physical interpretation. 
Thus one is told, for example, that applying an operator to a wave func- 
tion in the proper way will lead to predicting the outcome of an experi- 
ment. Neither the operator, the method of applying it, nor the wave 
function has a direct physical interpretation. This lack of a straight- 
forward interpretation is a definite disadvantage from a pedagogical view- 
point — it makes the theory harder to present — but it does not indicate any 
lack of validity of the theory in its applications. In so far as its applica- 
tions are concerned, a mathematical theory of an experimental science 
must satisfy one supreme requirement: It must agree with experiment. 
Although this requirement can be simply stated, we shall give several ex- 
amples to illustrate its meaning before discussing the agreement with 
experiment achieved by quantum-mechanical theory and the procedures 
used to test it. 

We shall employ the phrase applied mathematical theory to indicate the 
procedure which the theory specifies in order to predict relationships be- 
tween measured quantities. We are not here concerned with the modus 
operandi of the theoretician, or with how theory may be used to plan experi- 
ments and, thereby, to improve theory, although we shall comment briefly 
on such subjects at the end of this section. We are here concerned with 
the formalism of a well-developed applied mathematical theory, such as the 
quantum-mechanical theory of electrons in solids, and how it is related to 
experiment. The general nature of the connection between the, formalism 
of the theory and the application of the theory is indicated in Figure 14.1. 
We shall explain the various features by means of some examples. 

1 For a more thorough discussion of the subject matter of this section, see in particular 
P. W. Bridgman, The Logic of Modern Physics, Macmillan, 1931, and The Nature of Physical 
Theory, Princeton University Press, 1936, and R. B. Lindsay and H. Margenau, Foundations 
of Physics, John Wiley & Sons, 1936. The discussion in this section is somewhat similar to the 
chapter "Mathematics in Application" of The Nature of Physical Theory. 
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As the figure indicates, the application concerns measured quantities 
which are denned by physical operations, such as, for example, the volt- 
ages, currents, lengths and times, with which we are chiefly concerned in 
connection with the properties of semiconductors. The first step in the 
application of a physical theory is always the same and involves a change 
of viewpoint. Instead of thinking of the position of a pointer on the scale 
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Fig. 14-1— The Nature of an Applied Mathematical Theory. 



of an ammeter, one thinks of the symbol / which the theory uses to repre- 
sent any number which may be obtained as a result of reading the ammeter. 
This procedure is carried out in accordance with the rules of correlation 
between the "operationally denned" physical quantities and the mathe- 
matical symbols. 

For the case of a very simple theory, such as the application of Ohm's 
law to a cylindrical wire, the rules of correlation would be 

voltmeter reading — * V 

ammeter reading — > I 

length in centimeters — » L 

diameter in centimeters —* D 

resistivity in ohm-cm — * p 
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In this case there would be the additional concept of an area of cross section 
which would be given the symbol A. This listing would complete the 
first step and bring us to the second box, algebraic symbols, in the 
figure. 

The second step would require the insertion of these symbols in the 
algebraic equations 

i" = AV/pL for Ohm's law (1) 

and 

A = ttD 2 /4 for area (2) 

followed by the use of all the rules of algebraic manipulation, from which 
one might obtain the new relationship among the symbols 

L = xDT/W, 

an example of the contents of the third box in application of the theory. 

According to the rules of correlation, we could relate these symbols to 
measured quantities, thus taking the third step to the fourth box, and 
could then use this relationship co predict the length of a wire wound on a 
coil. 

Frequently, additional physical concepts are introduced describing quan- 
tities which are not directly measured. Thus, in the theory of conduc- 
tivity and Hall effect, we introduce the ideas of electrons and holes and 
their mobilities and concentrations. In dealing with a sample of ger- 
manium containing radioactive antimony, we may make predictions about 
the transverse voltage observed in a Hall effect experiment. This pre- 
diction will involve a relationship between voltage, current density, mag- 
netic field, linear dimensions and counting rate on a Geiger counter, time 
since the radioactive sample was calibrated, and so forth. However, in 
making this prediction we introduce many symbols, having quantitative 
values and associated with definite conceptual entities (like electrons and 
holes), which are not directly measured. All of these quantities are manip- 
ulated according to formulae discussed in Chapters 5 to 11 before the 
predicted relationships are finally cranked out. 

In quantum-mechanical theory the role of the experimentally measured 
quantities is frequently played not by quantities which are actually meas- 
ured but by ones which could in principle, at least, be measured if enough 
pains were taken. 2 Thus the position of an individual electron is seldom 
measured. The path, however, of a single electron can be observed in a 
cloud chamber, and the time at which it passes through a Geiger counter 
can be measured as well. The position of the electron can thus be meas- 
ured in principle. Also, its speed of motion can be measured by measur- 

2 In Section 14.3, however, we shall discuss a case in which actually measured positions are 
strongly influenced by quantum effects. 
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ing the time in which it travels a given distance. Quantum mechanics is 
concerned with the correlations between such measurable quantities as 
position, velocity, and energy of the electron. 3 In making predictions 
about the relationships of these quantities, the additional physical concept 
of a wave function is added. Manipulations are performed upon this 
wave function in accordance with the rules of manipulation. (In quantum 
mechanics it is hard to specify the difference between a rule of manipula- 
tion and a new physical concept ; for our purpose, it is not worth while to 
attempt it.) As a result of these manipulations with the wave function, 
predictions about the behavior of the electron are made. However, as 
will be discussed in Section 14.3, no direct physical interpretation of the 
wave function itself is given, nor is one required. In Section 14.3 we shall 
write the equations governing the wave function and discuss the rules of 
correlation. We shall precede this by a discussion in Section 14.2 of the 
mathematically similar subject of an electrical transmission line. But 
first we shall briefly discuss some additional aspects of an applied mathe- 
matical theory in general. 

It should be pointed out that the practice of applying a mathematical 
theory to experimental science is not, in the hands of a theorist, simply a 
matter of manipulating the symbols of the theory in a routine way. The 
sets of mental images employed by the theorist, as he carries out the manip- 
ulations, differ greatly from one worker to another and are intellectually 
the most engrossing aspects of theoretical work. These images and 
analogies permit the theorist to summarize the theory in a satisfying and 
convenient way. For example, a physicist with one type of viewpoint 
may prefer to visualize processes in terms of particles moving and collid- 
ing, whereas another may think of algebraic symbols and their patterns 
of relationship. 4 Frequently, with the aid of these mental pictures, the 
theorist can arrive quickly at conclusions which could be reached only 
more laboriously by symbolic manipulation. This procedure has its pit- 
falls, for the processes used may well be forms of reasoning by analogy and 
can easily be carried too far. This use of physical intuition may perhaps 
be called the art of theoretical physics. 

One of the difficulties of mastering an applied mathematical theory is 
closely related to the employment of several sets, rather than one set, 
of mental images: There are, on the one hand, the physical quantities 
themselves and, on the other, the symbols which represent them. (These 
latter are in a sense equally real, consisting of marks on paper or sound 

3 Quantum mechanics embraces, of course, a far larger field, but, for our present illustration, 
the treatment of an electron sufHces. 

4 We are here discussing topics connected with the use of "models", the meaning of an 
"explanation", and the requirement of "simplicity" in a theory. For a further discussion of 
these subjects see the references given to the heading of Section 14.1. 
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waves in air.) Manipulation of the symbols alone (at least to the extent 
usually needed for physics) does not pose intellectual problems of the 
same magnitude as those of mathematical physics; neither does experi- 
mentation in a laboratory. The difficulty of mathematical physics arises 
in the translation of the mental images by the rules of correlation. This 
difficulty is first encountered by students in high school algebra in con- 
nection with word problems. These problems are generally recognized 
as being harder than the problems which are stated in algebraic form at 
the outset. Still more difficult are the processes discussed in the previous 
paragraph, in which a mental image of the physical situation is examined 
from changing viewpoints consistent with the changing relationships among 
the symbols during the course of the manipulations. 

14.2 THE MATHEMATICAL THEORY OF A TAPERED 
LOSSLESS TRANSMISSION LINE 

We shall consider here a lossless transmission line with distributed con- 
stants which may vary from one position to another. Applying the 
appropriate rules of correlation, we say that, at time / and position x on 
the line, a voltage V{x, t) and a current I(x, t) may be specified. There 
are, in addition, the inductance per unit length, L(x), and the capacitance 
per unit length, C(x). The mathematical theory requires these six sym- 
bols to satisfy certain differential equations. In the discussion that 
follows, these equations are manipulated so that new and useful relation- 
ships between the measurable quantities, represented by the symbols, 
are derived. For example, we shall show that the expressions for energy 
stored per unit length on the line and power flow are consistent: The loss 
in energy for any segment of the line can be accounted for by outward 
flow of power at the ends. 

Having derived a number of equations, we next ask what sort of a theory 
we would have if it were impossible to measure current and voltage in the 
line, but instead only energy density. If this were the case, then V and I 
would have to be regarded as additional physical concepts introduced in 
the theory. Looked at in this way, the mathematical theory of the trans- 
mission line bears a close relationship to that of quantum mechanics in 
which two functions, like V{x, i) and I(x, t), are introduced to describe the 
behavior of the electron, which itself is only observed in connection with 
its probability density. 

In Table 14.1 are given a set of equations referring to this section and to 
the next. They are written in parallel columns to permit easy comparison. 
In this section we shall derive and discuss the relationships in column (A) 
leaving (B) for the next section. 

If distance is measured along the line from left to right by the variable x, 
with positive current flowing to the right, then equations (\A) apply to 
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the line. We shall use these to establish the consistency of the power flow 
and energy density expression. Thus, suppose we start with the physical 
idea, which may be regarded as an additional physical concept m Figure 14.1, 
that power flow to the right at any point of the line is VI evaluated at that 
point. Then the net power into the line segment between X\ and x 2 (xi<x 2 ) 
is 

P(x u x 2 ) = V(x u t)I(x u t) - V(x 2 , t)I(x 2 , t). (1) 

But the right side is mathematically equal to 

Jf**» d r x */dV dl\ 

f -{VI)dx= - / (-I+V--)dx 
%! dx o 'xi \dx ox) 

= -r(- L d ii-vc d Adx 

Jxi \ dt at) 



vxi at 

- L \ f 

dt Jx\ 



(LI 2 + CV*)dx. (2) 

This proves that the net flow of power into segment (#i, x 2 ) can be ac- 
counted for by the rate of change of the integral of \(LI 2 + CV 2 ) along 
the line. This is valid even when L and C are functions of # (but not of/) 
and is, of course, what we should expect on the basis of our previous knowl- 
edge that f LI 2 and ^CV 2 are energies per unit length. 

If the line extends to x = ± <=o and the current-voltage wave is localized 
(both V and / vanish at x — ± oo ), then the theorem considered above 
shows that the total energy in the line is independent of time and is equal 
to the integral 

,+ " WLI 2 + %CV 2 ]dx = S, (3) 



/.: 



which is constant [since P(xi, x 2 ) vanishes for x\ = — oo , x 2 = + 00 ]. 
The same conclusion is reached if the line has finite length A and is open- 
circuited (/ = 0) at both ends, or short-circuited (V = 0), or closed on 
itself so that power flowing out at x = A is simultaneously flowing in at 
x = 0. 

Equations (1A) may readily be manipulated to give equation (4A) and 
a similar equation for d 2 V/dt 2 . If we suppose further that L may vary 
while C does not, and consider a wave of frequency w so that 

I(x, t) =fi(x) cos oit +f 2 (x) sin wt, (4) 

then I(x, t) and/i and/2 must satisfy equation (5A). This equation is 
formally similar to (5B) which we shall discuss later, 
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Table 14.1 Comparison of Equations for a Lossless Transmission Line 
with the One-Dimensional Wave Equation for a Particle 

Transmission Particle 

Line (h = h/2w) 

(A) (B) 
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Density 
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Energy per unit 
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Probability per unit 
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dx 
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Now suppose that the only measuring instruments we had available were 
some combination of an electrostatic voltmeter and a-c ammeter so that 
we could measure only the energy density (2A). If we were able to set 
up disturbances in the line, or if these were somehow introduced by the 
experimental conditions, we should be able to study a number of the 
features of the line. If our measuring devices could operate on a sum- 
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ciently fine scale, we could shunt the line and measure standing-wave pat- 
terns and in this way learn something about the wave length of the waves. 
Finally, if we were ingenious enough, we might be able to invent a mathe- 
matics which would describe all the properties of the line so far as the meas- 
urements of average energy densities were concerned. Now, if this math- 
ematical invention led us to two equations such as (1A), with further 
consequences such as (2A) and (3A), we could, in the theory, talk about 
I and V without having any physical way of knowing about them per se 
or, for that matter, without ever having had any experience with induct- 
ances or capacitances except as they occurred in inseparable combination 
in the transmission line. This lack of direct experience would not, how- 
ever, hinder us in the application of the theory to predict results of power 
reflection, standing wave ratios, and so forth. Some additional physical 
concepts about V and / would enter the theory, but would go out again at 
the point where the rules of correlation were applied (in the third step of 
Figure 14.1) to predict relationships among observed quantities. 

As we shall now discuss, the roles of the real and imaginary parts of 
Schroedinger's wave function are analogous to / and Vol our hypothetical 
example. However, there is an important difference: Quantum-mechani- 
cal theory implies strongly that there is no underlying substratum of 
experience in terms of which \|/ can be interpreted in a manner similar to 
that in which / and V can be. 

14.3 THE MATHEMATICAL PHYSICS OF 
SCHROEDINGER'S EQUATION 

In Table 14.1 in column B we give a number of equations related to the 
one-dimensional Schroedinger equation. The particle in question moves 
along the #-axis under the influence of forces which give it a potential 
energy 6 U(^). The two wave functions u(x, t) and w(x,t) occurring in 
equations (IB) are required to be real just as / and V in (1A) must be real. 

The quantity u 2 + w 2 , which is thus real and positive, is called a proba- 
bility density. The rule of correlation for its interpretation is as follows: 
Suppose we have an experiment in which we detect the position of an 
electron at some instant /. For example, we find where the electron strikes 
and thus exposes the photographic emulsion of a plate in an electron 
diffraction camera. 1 In this case, the position of the electron in the 

1 The modern electron diffraction camera, which is the descendant of the early electron 
diffraction apparatus with which Davisson and Germer first observed that electrons were 
diffracted like waves, is built like a cathode ray tube. The electron beam (perhaps 10 -8 amp 
at 40 KV) strikes the sample, is diffracted and strikes the photographic plate, which usually 
replaces the fluorescent screen. 
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emulsion does not change with / for large /'s. We repeat this experiment 
over and over with many electrons, each of which is started down the 
camera in the identical way, and presumably with the same initial values 
of u and w (which in this case will be functions of y and z as well as of x 
and /). The first rule of correlation is: 

(RC.l) 

probability that electron be found between x\ and x% 



= r* (u 2 + w 2 )dx / f " (u 2 + vr 2 )dx. 



(1) 



That is, at any instant /, the value of u 2 + w 2 is proportional to the 
probability per unit length that the electron is at a given place. In the 
repetitions of the experiment, this probability gives the fraction of the 
electrons found in each interval. The theory can thus be compared with 
experiment. When this is done, it is found that the darkening on the 
plate is precisely as predicted from the theory. Furthermore, the pre- 
dictions are far from trivial: From the nature of the interference with the 
crystal, the wave length of the electron waves can be determined just as 
accurately as the wave length of the electromagnetic waves of the last 
section could be found. Furthermore, from the values of the potential 
energy function, G U, in the crystal, intensities of the waves can be cal- 
culated. In all of these respects, the mathematical theory represented by 
equations (IB) of Table 14.1 and rule of correlation (RC. 1) appears to be 
in agreement with experiment. 

In order to simplify the probability calculations, U and w are usually 
multiplied by a scale factor (normalizing factor) so that u 2 + w 2 is nor- 
malized: 

+ " (u 2 + vr 2 )dx = 1. (2) 



f. 



Then the denominator can be left out of (RC. 1). In general we shall use 
normalized wave functions. 

As shown in Table 14.1, a density flow (3B) can be found for this case 
analogous to the power or energy flow for (3A). If we consider a length 
of line between # x and x 2 , then the rate of change of probability that the 
electron be in this interval is 



d_ 
dt 



J [u 2 (*, t) + w 2 (*, t)]dx = 2j x (u — + w — j dx. (3) 
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Substituting (IB) for the time derivatives and integrating by parts gives 

h P*i ( d 2 n d 2 w\ , 

= - / I w —j - u —y )dx 
m Jx x \ dx dx / 

h / du dw\K h P^/d-wdu du dw\ 

= — I w U — I / ( )d>< 

m\ dx dx/W mJxi \dx dx dx dx / 

h ( dw du\ h ( dw du\ 

= — I u w — 1 lu w — ) > (4) 

m \ dx dx/ Xl m \ dx dx) Xi 

which shows that the change in probability between x\ and x 2 can be 
accounted for by a flow (h/m) (udw/dx — wdu/dx) to the right as shown 
in (3B). This result is derived for three dimensions in Section 15.4. It 
is given here to illustrate the procedure required in later proofs in this 
section. 

In addition to explaining electron diffraction results, predictions based 
on the same interpretation of u 2 + w 2 , when used for electrons in atoms, 
give rise to probability densities like those shown for H, H 2 , C, Si, and Ge 
in Chapter 1, and these probability densities lead also to predictions of 
experimental results, somewhat less directly, by means of other methods 
of correlation. 

In Table 14.1, we have deliberately written Schroedinger's equation in 
an inconvenient and unfamiliar form. The purpose has been to show that 
there is no greater cause for mental turmoil about the fact that the wave 
function 

i\r(x, t) = u(x, t) + iw(x, t) (5) 

has real and imaginary components than about the fact that energy density 
on the transmission line can exist in two forms. The quantities u and w 
must be real numbers. They are called the real and imaginary com- 
ponents of the wave function, \|/. \|/ is also called the "probability am- 
plitude", to distinguish from "intensity" which is proportional to (ampli- 
tude) 2 and which occurs in expressions for density and density flow, 
especially in theories of mechanical and electrical systems. For our one- 
dimensional case, u and w have dimensions of length - ** (i.e. cm - ^) — cm - ^ 
in three dimensions — since when squared and integrated over length (cm) 
they give unity, a pure number with no dimensions. In the rules of 
correlation, u and w never occur singly but only in products, like u 2 or 
udvr/dx. For this reason there is no direct physical significance to be 
attached to u or w or \|( individually. Although called amplitudes, there 
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is no physical entity the displacement of which they represent. 2 In other 
words, t|r is something put on paper by a theoretical physicist and manipu- 
lated according to certain rules as aid to predicting and interpreting 
experimental data [the most important rule of correlation being (RC. 3), 
given on p. 375]. However, as a result of experience and practice, the 
manipulator becomes so facile in applying the rules of correlation that it 
may appear to the onlooker that the wave functions themselves are con- 
sidered to be the ultimate reality — an erroneous impression which is 
frequently refuted in quantum-mechanical texts. 

The reader may well ask: If the wave function is so ephemeral a thing, 
why do more than thirty illustrations of it appear in this text? The 
answer is that as an aid to prediction of electronic behavior it is at present 
unexcelled, and that, with familiarity in its use, one gains ability usefully 
to predict, or at least to interpret, observable results. 

The reason that \|f is written in complex notation with an imaginary 
component is simply a matter of convenience. The rules of correlation 
employed in combining the two real numbers u and w can be expressed 
compactly and conveniently by use of the notation of complex numbers, 
with the formal rule that, wherever i 2 occurs, it may be replaced by —1. 
Thus, for example, the complex conjugate of \|/ is v)»*: 

if -if = u + *w then by definition i|r* = u — ?w, (6) 

and 

\|/* -if = (u - ;'w) (u + m) = u 2 - z' 2 w 2 = u 2 + w 2 (7) 

and by definition the absolute value of \J/ is 

|^| = ty'ty)* = Vu 2 + w 2 , |x|/| 2 = ^*i|r. (8) 

It is an obvious result that i|r*v|/ is always real and positive unless \|/ = 0. 3 
Proceeding with the use of complex number notation, we convert equa- 
tions (IB) of Table 14.1 to the customary form as follows: We multiply 
the first of equations (IB) by ih and the second by — h and add, obtaining 
in this way 

(from the left side) 

du dw d d\[f 

ih h — = ih — (u + m) = ih— = (9a) 

dt dt dt K ' dt 

(from the right side) 

2 It appears to be as fruitless to be concerned about what + is a displacement of as it has 
been to invent models of the ether in which electromagnetic fields constitute some form of 
mechanical stress. 

3 Another example in which a complex notation makes for easier manipulation of real 
quantities is given in the footnote of Section 7.7 and in Section 12.3. 
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so that 

ift^ = DC^, (9c) 

at 

where, by definition, 3C is the linear operation in the square brackets. [As 
we shall see later the interpretation of (h/i)d/dx as momentum indicates 
that the + sign in ± is the more appropriate.] We may now state: 

The time dependent Schroedinger equation for a particle moving in one 
dimension in a potential field ^(x) is 

dti h 2 d 2 

where m is the mass of the particle and h is Dirac's notation for Planck's 
constant h divided by 2ir. 

For the pedagogical purposes previously mentioned, we introduced this 
equation by the less direct procedure of Table 14.1. 

So far we have discussed only one rule of correlation, namely, (RC. 1), 
equation (1), Section 14.3, relating to the probability density u 2 4- w 2 = 
|v|f| 2 . From this we derived the density flow formula (3B) of Table 14.1. 
We shall next draw further conclusions from (RC. 1) and equation (10) 
in preparation for a statement of the general rule of correlation (RC. 3), 
equation (19) of Section 14.3, which is one of the fundamental postulates 
of quantum theory. In these derivations we shall suppose, for the sake 
of discussion, that \J/(«, t) has the form of a pulse or wave-packet, although 
the formulae apply equally well in more general cases. 4 We shall calculate 
the average position of the electron in the wave-packet, the mean square 
deviation of its position, and, finally, the average velocity of motion of the 
wave-packet. The manipulations are given in some detail so that the 
nature of the averaging processes will be obvious. 

We suppose, therefore, that there is an experimental arrangement, like 
that discussed in connection with the electron diffraction camera, with 
the aid of which we can study the statistical behavior of many (say N) 
electrons, all of which have behaviors so much alike that they may be 
represented by the same wave function v|f (#, /) = u(x, i) + ivf(x, t), where 
t is measured from the start of the experiment. (If the experiment were 
repeated at successive times for the difFerent electrons, then the t's would 
be measured from difFerent zeros.) Now suppose that at time /, for each 
electron, we make an observation of the electron's position. Then by 
(RC. 1), assuming \J/ normalized in accordance with equation (2), we find 
a fraction \^(x, t)\ 2 dx of them with positions between x and x + dx, so 
that, on the average, we obtain values of x between x and x -\- dx just 

4 The mathematics of such wave functions is considered in Sections 15.5 and 15.6, a par- 
ticular wave-packet being shown in Figure 6.2. 
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iV|\|f(.tf, t)\ 2 dx times. Consequently, the set of all the measured values 
*ij x 2> • • • > X N will give an average value of 

. 2v/ ,+ " *!+(*> <)l 2 <& 

- _ (*i + *2 + • • • + *jv) _ «/-« 
*~ iV ~ N 

= f "x|^(*,/)| 2 i*. (11) 

%J — 00 

This quantity is, by definition of an average, the average value at time t 
after each experiment started of the position of the electrons in the N 
repeated experiments. If our experiment is set up in such a way as to give 
us a well-defined pulse of probability density, then the ^-values will cluster 
about the mean value x, with a certain spread about this value. It is 
instructive and useful for later applications to define a mean value of this 
spread. This may be done as follows: From each value of* in the list of 
N individual observations, 5 subtract x so as to obtain the deviations 
Xi — x, X2 — x, etc.; square these deviations and average them. 6 By 
definition of the averaging process, this average, called the mean square 
deviation, is denoted by (x — x) 2 and is given by 

*\2 l . . . I /-„.. _ = \2 



(*i - xy + (x 2 — xy + • • • + {x N — x) 



(x - x? = ^ '—^-^ ~- '-^ '- • (12) 

This can be written in the form 



(x — x) 2 = (x 2 — 2xx + x 2 ) = x 2 — Ixx + x 2 

= x 2 -x 2 . (13) 

Now, by the same reasoning as that used for the specification of x, we 
conclude that x 2 is given by 

-5 *i 2 + *2 2 + • • • + x N 2 r+» 



N 



f ° x%(x,h)\ 2 dx. (14) 



The principal reason for carrying out these manipulations in detail at this 
point is to illustrate the use of the probability density |^| 2 in computing 
averages of x and functions of x (that is, the simple function x 2 ). We 
shall have occasion to discuss other mean squared values in a later section. 
It is evident from equations (12) and (13) that, only if |\|/(x, t)\ 2 con- 

6 In principle we did the experiment N times and recorded the result N times although, in 
practice, the averaging of the x's may be accomplished by the measuring instrument itself 
which might, for example, display the total charge induced on a capacitor plate by the N 
electrons. 

6 The average of the deviations themselves is zero, since the plus and minus deviations 
cancel out. 
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centrates all its probability at one particular value of x (so that every 
x — x = 0), does x 2 = x 2 . Otherwise x 2 always exceeds x 2 . 

We shall next make use of the value of x to describe one way in which an 
average velocity may be measured in principle. The value of x, which 
we now denote by x(t), depends upon / through the time dependence of 
i\i(x, t). The mean position of the electron thus changes at the rate 

dx{i) d r+ a 14/ N , . 

— - I xy*(x, t)\\r(x, t)dx 



dt dt% 



*+ + -+*f]* 



-Xt"[> 

/+" r du dw~] 

.. n^+^r* (15) 

where the last form can be derived directly from the u 2 + w 2 expression. 
The time-derivative terms can be removed from the u, w form with the aid 
of equations (IB) of Table 14.1 or from the v|f form with the aid of equation 
(10), and the resulting equations integrated by parts (provided \}r vanishes 
at ±°o, as it must in accordance with our packet assumption), the 
manipulations finally yielding the result 



«/-» m i ox 

= - / I"— -w — )dx. (16) 7 

»2«/-» \ dx dx/ 



The last two expressions are equal because 

+» 



X 



u — dx = kvi 
dx 2 



= (17) 



and a similar result holds for w. Equation (16), written in the \}r form, 
is an example of one of the general rules of correlation which we shall now 
express: The symbol 

(RC2) p-t± (18) 

is called an operator since py\i performs the operation 7 of differentiating vj;. 
The operator p is correlated with the classical momentum of the particle 
and when properly averaged (as described below) gives the mean value of 

7 It may be noted that, whereas x = j x\ty\ 2 ,dx can equally well be written as / ty*xtydx 
in accordance with (16); p = / i|»*p+^v is not equal to / p |i|/|V#. 
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mdx/dt. There are other operators which are likewise correlated with 
attributes of the electron's behavior which are, at least in principle, 
measurable. For all of these operators we have: 

(RC. 3) the average value: Select the operator, 9L, correspond- 
ing to the quantity of interest, operate upon the wave func- 
tion "if with it, multiply by \|/* and integrate. The result is 
the expected average value which would be obtained from a 
series of measurements of the quantity made in experiments 
in each of which the behavior of the electron is described by 
the wave function v|/: 

<&) = fy*9tydx. (19) 

We have seen two examples of the use of such operators: The position of 
the particle has the operator x; and the momentum p, or mdx/dt, has the 
operator (h/i)d/dx. The averages 8 are computed by 

<*> = f " ty*xtydt (20) 

ty*p\\idx = ty*-.— tydx (21) 

oo U — tx> % dx 

The average value of p, it may be remarked, is, at least in some experi- 
ments, a directly measurable quantity. Thus, in velocity-modulated 
tubes, pulses of many electrons flow down the drift-space and give detect- 
able signals as they cross the output circuit. From the solutions of the 
time-dependent Schroedinger equation, it is possible to make predictions 
of the average value of dx/dt for an electron which moves in a potential 
field 6 lt. It can be shown, by methods similar to those discussed in Chapter 
7, that such an electron wave-packet will move with precisely the velocity 
expected for an electron on the basis of Newton's laws of motion. If this 
same electron beam strikes a crystal at a point along its path, diffraction 
effects in keeping with its wave length will occur. 

Let us next consider the average values and operators for potential 
and kinetic energy for an electron. Since, when the electron is at x, it 
has potential energy %.(«), the average potential energy is, by reasoning 
similar to that used to evaluate (x 2 ), 

(%)=/* °° SUV + vr 2 )dx = f " etl|i|r|V* = f " ty^dx. (22) 

t/ — 00 t/ — 00 V — oo 

8 We shall use interchangeably (x) and x to denote averages, depending upon readability 
and economy of type setting, in this section. 
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We shall next show that the logical operator to use for the kinetic energy is 

' = 2m = 2m \7 die) \i die) ~ ~ 2m dx 2 
which gives 

The reason for this choice is that we can show that the sum (K.E.) + < G U> 
remains constant in time [that is, prove that d[(K.E.) + {^i))/dt = 0] no 
matter what solution we have for equations (IB) of Table 14.1. This 
result can be established by taking the time derivatives under the integral 
signs, substituting from equations (IB), and integrating by parts. (We 
shall do this in the next section, making use of the manipulation procedures 
for operators.) The result is that 

j f ((K.E.) + <%» = 0. (25) 



This result can be expressed more simply by noting that the operator for 
total energy is simply the operator DC 

2m 2mdx 2 



K.E. + eu = ^ + GU=- :r - 7 - 5 + % s 3C (26) 



so that (K.E.) + <6U) = <3C> and d(VC)/dt = 0. 

The presentation given so far has been intended to exhibit quantum 
mechanics as an applied mathematical theory which is used to make pre- 
dictions about the behavior of electrons. The exposition has stressed the 
similarity to the transmission line problem in order to show that it is not 
necessary to have a further interpretation of \[f = u + zw in order to carry 
out calculations according to rules of manipulation and make predictions 
on the basis of rules of correlation. The order of presentation has been 
quite different from that used in most texts on quantum mechanics since 
it has been assumed that many readers will have some intuitive feeling 
about transmission lines and thus could be helped by considering the 
analogy between energy density and probability density. The pair of real 
functions U and w were introduced in place of the mathematically equiva- 
lent complex function \|; for the same reason. On the basis of the relatively 
simple interpretation of \|r*\|f as a probability density, the average of x was 
introduced, which led, for example, to the interpretation of the operator p 
as a means of computing x. The formal procedure of quantum mechanics 
introduces these rules of correlation compactly as postulates from the 
outset and deals conveniently with the integrations by parts by means 
of the symbolism of Hermitian operators. 
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We shall introduce this symbolism briefly in the next section, and use it 
later to help establish the idea of energy eigenfunctions. 

14.4 OPERATOR METHODS 

Of fundamental importance in quantum mechanics is the fact that the 
important operators are Hermitian. An operator 91 is Hermitian if, for 
any two wave functions <|> and \|/, the following equation is satisfied: 

f tf&fydx = f (9L+)*ydx. (1) 

The result depends upon using wave functions which behave properly a 
the limits of integration, which satisfy the proper boundary conditions. 
We shall consider two cases: (I) a and b are finite and \|/ must vanish at 
both limits. 1 This corresponds, as we shall show in Section 14.6, to 
infinite potential energy outside the interval a — b so that electrons are 
reflected at the boundaries. (II) a = and b = A represent the same 
point of a cyclic or periodic model (see Figures 5.6 and 5.9), so that \|/ is 
the same at x = a and x = b. In the remainder of Chapter 14 it will be 
assumed that the individual wave functions, or sets of wave functions, 
consistently satisfy one or the other of these two boundary conditions. 

That the following operators are Hermitian is evident, almost by in- 
pection: 

91 = a positive or negative real number, like h/m. 
(Although the operation is trivial, calling a 
number an operator is required by the general 
symbolic procedure.) (2) 

91 = x, x 2 or any real function of x, like 6 U(^). (3) 

The result is less evident for 

p = (h/i)d/dx (4) 

K.E. = {\m)p 2 = (h 2 /2m)d 2 /dx 2 . (5) 

We shall prove the Hermitian property of p by direct manipulation, and the 
Hermitian property of K.E. from that of p. By substituting p for 91 in (1) 
and integrating by parts, we obtain 

l^dx=-.j a ^ldx 

/ I a I Ja dx 

1 The case of « = — <x y i = + x can be considered as a limiting case of (I). 
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since the integrated term vanishes for boundary conditions (I) and (II). 
Before treating K.E., we note that if 91 and § are Hermitian and r and s 
any two real numbers, then 

r9\. + j§ is Hermitian. (7) 

The proof is simple. Also, if 91 is Hermitian so is 9i 2 ; since we can write 
Xx = Sty and X2 = Styj we have 

f <k*9i 2 \\tdx = f $*9lxidx = f (&$)*xidx = etc. 

%Ja %Ja Ua 

= f (& 2 <t>)=V*. (8) 

Ja 

Applying (8) top 2 and (7) to (^m)p 2 , we conclude that K.E. is Hermitian. 
By applying (7) to K.E. and %, we conclude that 3C is Hermitian. 

The procedure of integration by parts used with p can be used to estab- 
lish the instructive theorem that the average value of the K.E. is never 
negative: If the state of behavior of the electron is described by a wave 
function y\r, the value of (K.E.) can be transformed as follows: 

2m Ja T * r 2m Ja \i dxj \i dxj 

h 2 r b \W\ 2 

= ^r br dx - & 

2m Ja J dx\ 
As we shall show next, it is possible to find an operator § whose average 
value, computed by the average value rule, gives the time derivative of the 
average value of another operator 91. To rephrase this symbolically, 2 

(§) s /V§*K* = T <&> = T fv^dx. (10) 

J at at J 

We shall derive the following equation for §, to which we give the symbol 
d9L/dt, 

§ = ^ = 1 (90C - DC91). (II) 3 

at tn 

We shall first illustrate the meaning of the symbolic operator form on the 
right for the case 91 = x; to do this, we let § operate on any \|/ and simplify 
the resulting expressions: 

Svf, = 1 (ffOC - 3C&)f 

-k'(-«+«)*-(-££+*)-4 ™ 

2 We shall in future equations omit writing the limits of integration; it is to be understood 
that the integration extends over the range specified by the boundary conditions. 

3 The reader may verify that if A and B are Hermitian, i(AB — BA) is also. 
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since 

a 

dx 2 ' 



we obtain 



x t-& + L( x °$)_ 2 & + it % (13) 

dx dx \ dx/ dx dx 

in L 2m d#J mt dx m 



the result previously obtained in equations (16), (18) and (RC. 3) of 
Section 14.3, by evaluating x. To prove (11) in general, we write: 

«->-/[¥«♦ + ♦«?]* 

in J 
= j /*v|f*(-X£R. + 9l3C)i|ftk (IS) 

which is equivalent to equations (10) and (11). 

From this result it easily follows that / \|/*if^/.v is independent of time 

as it must be for |t|/| 2 <& to represent a probability. 

The form of (11) and the results just considered for x show that two 
operators do not always commute; by definition: 

If StS = S$, then 91 and § commute. (16) 

The failure of quantum-mechanical operators to commute is what causes 
quantum mechanics to lead to results different from those of classical 
mechanics. 

If the operator 91 in equation (11) is tJC itself, then the rate of change 
of (3C) has the operator 

=7^ = - (floe - dc&) = - (to: - :oc) = o (17) 

at in in 

so that the average value of jfC remains constant. This is a quantum- 
mechanical expression of the principle of conservation of energy. In 
general any operator which commutes with 3C has a constant average 
value. 

In Chapter 5, great emphasis is placed upon the eigenfunctions of 
Schroedinger's equation. These satisfy the equation 

3ty = gf, (18) 

which we shall discuss in the next section. In operator language, the 
energy 5 and wave function \|/ which satisfy this equation and the boundary 
conditions are said to be an eigenvalue and an eigenfunction, respectively, 
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of the operator 'DC. All the operators of quantum mechanics have eigen- 
functions and eigenvalues. For example, the operator p has eigenfunc- 
tions for the case of cyclic boundary conditions [that is, ty(x + A) = ty{x)] 
which satisfy the equation 

pty, = Pstys (19) 

where *|/ s equals exp (Irisx/A) where s is any integer and the number p s 
equals 2irhs/A = hs/A, as may be verified by substitution in equation 
(19) using (RC. 2), equation (18) of Section 14.3, for p. 

The formulae which give the relationships between operators and the 
results of their operations upon various wave functions form a major part 
of quantum theory. The brief discussion which has been presented of this 
topic has been intended chiefly to show that DC and p are Hermitian and to 
derive the operator for d9l/dt. 

We have developed a description of the operators of quantum mechanics 
in terms of the manipulations used in dealing with wave functions. In 
closing this section, we shall point out that a very different procedure can 
be used which is more fundamental but at the same time more abstract 
and less easily grasped on first contact. In the more fundamental pro- 
cedure the relationships among the operators themselves, without specific 
reference to wave functions, are made the central part of the theory. In 
fact, starting with the relationship 

px — xp = ft/i, (20) 

which is taken as fundamental in such treatments [whereas we would 
derive it here from (RC. 2), equation (18) of Section 14.3], and the expression 
for the energy of a harmonic oscillator of frequency &> 

X = $m)(p 2 + m 2 o> 2 x 2 ). (21) 

Dirac 4 derives all of the observable properties of the oscillator by dealing 
symbolically with the operators without introducing ty(x, t) at all. This 
typifies the emphasis which Dirac places upon the symbolic method and is 
a result which further illustrates the lack of fundamental significance of 
the wave functions, which, he writes, are introduced in his exposition 
"merely as an aid to practical calculation". Another treatment in which 
operators and their symbolic relationships are taken as fundamental is 
that of J. von Neumann. 5 

14.5 ENERGY EIGENFUNCTIONS 

Of particular value in the theory of quantum mechanics are the energy 
eigenfunctions. These are the particular wave functions for which the 

4 P. A. M. Dirac, The Principles of Quantum Mechanics, Oxford University Press, 1930, 
p. 123, or second edition, 1939, p. 133. 

6 J. von Neumann, Mathematische Grundlagen der Quantenmechanik, Julius Springer, 1932. 
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energy has a definite value. This requires that, if the energy is measured 
in TV similar experiments (just as x was measured in TV experiments in 
Section 14.3), the same value for K.E. + % = 3C will be obtained in each 
measurement. Now (RC. 3), equation (19) of Section 14.3, does not tell 
what values will be obtained in a large number of measurements but only 
what their average (!JC) will be. However, (RC. 3) can also predict the 
average of 3C 2 , denoted by (X 2 ), and by the same reasoning as was used 
with (x 2 ) and (x), we conclude that, if (3C 2 ) is equal to (DC), there can be no 
spread in the measured values for 3C. In other words, if the application 
of (RC. 3) to \|/ for "3C and DC 2 gives 

<3C) 2 = <3C 2 ), (1) 

then every observation will yield (3C). We shall find that we can prove 
from (1) that 

3C\|,-&|/ = 0, where & = <3C> = f y\>*3Cx\idx (2) 

which is recognized as Schroedinger's equation for the eigenfunctions dis- 
cussed in Section 5.2. This result is proved for the one-dimensional case 
considered in this chapter by making use of the Hermitian properties of 
3C, S>, and !JC — & (S is simply a number in these manipulations) in 
evaluating the integral: 

f\CK- &)ty\ 2 dx = f[QC - &)y]r]*(DC - &)tydx 

=fty*CK - S)[(3C - S)#fc 

= JVpC 2 - 3CS - S3C + & 2 ]y\>dx 

= <3C 2 > - <3C)S - S(tJC> + S 2 = <3C' 2 > - S 2 . (3) 

Now the extreme right-hand side is zero by (1) and the definition of 5 as 
(3C). Hence the integral in the extreme left-hand side is also zero, and 
the integrand | (X — S)\J/| 2 , which is always positive or zero, must be zero 
too, for all values of x. This establishes (2). 

The time dependence of an eigenfunction of Schroedinger's equation, 
satisfying (2), is very simple. We have, in fact, from equation (10) of 
Section 14.3 and (2), 

ih ^^lH = 3Ci|f (*, /) = Si|»(*, ') ( 4 ) 

dt 

which can be integrated at once to give 

+(*,/) =+(*,0)e-' w/ *. (5) 
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It follows that i|f*(*,/)*K*,/) = t*(*>°)t(*>°) s0 that the Probability 
density is independent of time and is therefore stationary in space. For 
this reason, energy eigenfunctions are also called "stationary states". In 
(5) we see that the complex phase of the wave function varies with the 
angular frequency, w = -S/h; or, letting u = -2irv, we have 

hv = 6. (6) 

This shows that the Planck relationship between energy and frequency 
holds for the frequencies in the eigenfunctions. 

An especially important feature of the eigenfunctions is that they form 
a complete set of Junctions in terms of which any solution of equation (10) 
of Section 14.3 can be expressed. In this way, they are analogous to 
Fourier components in terms of which any function periodic in time can 
be expressed. We shall not discuss the question of completeness 1 but will 
demonstrate the orthogonality property of the eigenfunctions and derive 
the expansion formula for an arbitrary wave function in terms of the 
eigenfunctions. 

Two wave functions, \J/i and ^ 2 , are said to be orthogonal 2 if they satisfy 
the equation 

If \J/i and v|/ 2 are eigenfunctions of DC, corresponding to two distinct eigen- 
values Si and &2, then they are orthogonal as may be proved as follows: 
Let 

DCx|/! = Sn|n, DCx(f 2 = 6 2 ^ 2 ; (8) 

then we have 

f^JC^dx = JvSaiM* = ^ftyi^x, (9) 

also 

fySX^sdx =f(Xtyi)*+2dx = &iJV**M*; (10) 

hence by subtracting left side from left side and right from right 

= (6 2 - so T iKSM*- ( n ) 

1 The proof of completeness is a rather advanced topic. See E. C. Kemble, Fundamental 
Principles of Quantum Mechanics, McGraw-Hill Book Co., New York, 1937. 

2 The word "orthogonal" implies right angle so that in function space (7) resembles a scalar 
vector product which vanishes if the vectors are at right angles. 
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By hypothesis, £>2 — Si ?* 0, hence the integral vanishes, which estab- 
lishes (7). It is evident that the same result could be proved for the 
eigenfunctions of any other operator. 

If we have a complete set of normalized eigenfunctions tj/i, v|/ 2 , • • • , then 
any wave function v|/ which satisfies the same boundary conditions as the 
eigenfunctions can be expanded in terms of them in the form 

\|/ = aityi + a 2 ^2 + ■ • • = £ a$s (12) 

where the coefficients «i, «2> • • • > are readily evaluated as follows: Multi y 
by any eigenfunction \|/ r * and integrate: 

/ \\> r *y\rdx = E «« / ty r *ty»dx. (13) 

The only non-zero term on the right side is a r f \|f r *i|f r ^.v = a T since \|r r is 
orthogonal to the other v|/'s. Hence we obtain the result, 

a r = fy\> r *T\rdx. (14^1 

Our orthogonality proof depended upon the eigenvalue for \\t r being 
different from that of \|/ s . In many cases two or more wave functions have 
the same eigenvalue which is then called a degenerate eigenvalue. If \J/ r 
and \|/ 8 are any two such t|;'s, which are normalized, but not orthogonal, 
then they can be combined to form two new wave functions 

\j/ a = a r y\t r + a^s and \|fj = 4-^r + ^ s v|/ s (15) 

and the coefficients chosen so that \|/ a and \(f& form an orthogonalized, 
normalized pair. 3 This procedure can be extended if more than two \|/'s 
are degenerate, and in this way any set of eigenfunctions can be brought to 
the prthonormal form, for which 

JVxM* = (J ^ ^) = «r.- (16) 

The symbol 5„(= 1 for r = s; = for r 5^ s), called the Kronecker delta 
symbol, is frequently used to represent the right side of (16). 

It is evident that, whenever the set of i|> s are orthonormal, the expansion 
coefficients in (12) are given by (14). 

8 For example, + r and +„ — ( I +,*+,<£* 1 <|» r are orthogonal and the latter is normalized 

/ I r |2\h 

by dividing by ( 1 — I +/+«(/* ) • 
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14.6 SOLUTIONS FOR A SIMPLE "ATOM", TRANSMISSION 

LINE ANALOGUE 

We shall illustrate the theory of energy levels for an electron in an atom 
and in a crystal by treating in this section a simple one-dimensional model 
of an atom and in the next section a periodic array of such atoms. The 
atom model to be discussed is represented by Figure 14.2(a) and consists 
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Fig. 14-2 — Potential Wells Representing Simplified One-dimensional Atoms. 

(a) The symmetrical well used in the text. 

(b) An unsymmetrical well. 

simply in a depression of the potential energy for the electron. This 
particular shape is frequently referred to as a square potential well. Our 
problem is to find solutions of Schroedinger's equation for an electron in 
this potential field, 

h 2 d 2 ty(x) 
2m 



3GK*) = 



dx> 



+ suooiK*) = SfO), 



(la) 



%(*) = 0, |*| < b, %(*) = ID, |*| > i, 



(lb) 



subject to the boundary conditions that x|f (#) must vanish at x = + °° 
and — oo, corresponding to the limiting case of boundary condition (1) 
for Section 14.4. We shall find that for any pair of fixed values of the 
depth 19 and width 2b of the well, there is a definite set of solutions which 
give wave functions in which the electron is bound to the well. 

We shall first show that for a symmetrical well, the wave function i|/ 
may be chosen to be either an "odd" function of x or an "even" function. 
This is done as follows: Suppose x|/i(x) is a solution of the equation with 
energy S 1} then 

\|/ 2 (#) = tyi(-x) (2) 

is an equally good solution, since the well is symmetrical about x — 0, and 
has the same energy Si. Now consider 

g{x) = t|/!(*) + \\t 2 (x) (3a) 

u(x) = \|/i(#) — \|>2(*)- (3b) 
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If \|fi is expanded in a power series in x, all of the odd terms will cancel 
out of g and all of the even terms will cancel out of u. Evidently 

g(*) = g(-*0 and u(x) = —u(—x). (3c) 

The symbols g and u correspond to the German gerade or even and un- 
gerade or odd. Unlike the symbols x and «j> introduced below, g(x) and 
u(x) are considered to be defined for all values of x. We shall next obtain 
explicit expressions for g(x) and u(x) by first finding solutions to equation 
(1) inside the well and outside the well individually and then by joining 
these together at x = -\-b and x — —b. We shall find that eigenfunctions 
satisfying the boundary conditions can occur only for certain values of the 
energy. 

In the well, the wave equation is 

d 2 y\i 1m 

dx 2 fi 



2=-T^^t (4) 



for which the even and odd solutions are 

Xg(#) = cos (2mS/h 2 ) H x -b < x < b (Sa) 

X„(*) = sin (2m&/h 2 )*x -b < x < b (5b) 

where the symbols Xg an d Xu ar e simply abbreviations and are defined 
only for \x\ < b. 

Outside of the well, the wave equation is 

A|; 2»i(TD - S) 

dx*~ = K 2 +• (6) 

This equation has a sinusoidal solution for & > TD and does not vanish at 
infinity. Such solutions can be combined into wave-packets and used to 
represent free particles moving in space outside of the potential well. 
They do not, however, represent the bound states which we wish to con- 
sider. If & is less than °IP, the particle does not have sufficient energy to 
escape from the well, even though its wave function, as we shall see, does 
not vanish outside of the well. For this case the possible solutions of the 
equation are 

<(>_ = exp [2wCffl - &)/h 2 ]*x (7a) 

4>+ = exp -[2w(T8 - &)/h 2 ]*x (7b) 

where the positive square root is to be understood by [2m(W— E)^/h 2 ]^, 
the <j> symbols, introduced for convenience, are defined only for |,>q > b. 
The solution <|> + increases without limit as x — -> — o° and cannot be used 
as a solution for negative values of x; it can be used for positive values 
of x since it vanishes at x = + oo ; for this reason its subscript is a plus 
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sign. Because of the symmetry properties of g and u, we need consider 
only the solution for x > since the solution for x < can readily be 
obtained from it by replacing x by — x; we shall illustrate this below. 

We shall first look for solutions of the even type which are valid for 
x>0. These must evidently be of the form 

g = ^Xg(*) = A cos (2«6/ft 2 ) H *, -b <x <b (8) 

g = B++(x) = B exp -[2m(°® - &)/h 2 ]*x, b < x. (9) 

How should these be joined at x = b\ The requirement is that both g 
and dg/dx must be continuous functions of x at b. This follows directly 
from the differential equation which states that in any infinitesimal inter- 
val 8x the change in dAf/dx is given by 

\_dx J Jb-(sx/2) ihr 

Since -if and [^(x) - S] are both finite, although ^(x) is discontinuous 
at x = b, the integral vanishes as Sx —»• so that dAt/dx is continuous at b. 
It is even more obvious that \|/ must be continuous at x = b because a 
discontinuity in \|/ would require an infinite value for dif/dx. 

Thus if it is possible to choose A and B so that g and dg/dx are continuous 
at x = b, a wave function satisfying equation (1) and the boundary con- 
ditions at x = + °° and — °o will be obtained. Such a wave function, 
which is by definition an eigenfunction of equation (1), can exist only for 
certain eigenvalues of the energy which are obtained by requiring that (8) 
and (9) join properly. Continuity of g requires that 

J Xg (b) = B$ + (b) (Ha) 

and continuity of g' = dg/dx requires that 

AyJ{b) = B$ + '{b) (lib) 

so that dividing the second equation by the first we obtain 

±^ = i-^± at* = *. (He) 

Xe dx <j>+ dx 

In order to deal with this condition analytically, we introduce the symbols 



a = V2nfiI)b/h (12a) 



|3 = V2ot(1D - &) b/h = Va 2 - 0* (12b) 



6 = Vlribb/h. (12c) 
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In terms of these we have 
dx 



X* = - (*/*) tan 8 



(13) 



(14) 



so that the eigenfunction condition reduces to 

-0 tan = -jS = - Va 2 - 2 . (15) 

(We retain the minus signs so as to be in accord with the sign convention 
for y and |i in the following section.) 
A similar treatment for u leads to 



cot = -0 = - Vc 



(16) 



Both and /3 depend on 6>, and the eigenvalue problem for & is thus to 
find the values of S for which the equation is satisfied. Figure 14.3 shows 
how this may be done graphically by plotting both sides as a function of 0, 
which is equivalent to & for this purpose, with negative values of £ cor- 
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Fig. 14-3— Solutions for Wave Functions in an Energy Well. (The light lines for 

wave functions gi and Ui correspond to ID = oo . The figure is not quantitative but 

shows correctly the qualitative features.) 



responding to imaginary values for 0. On this plot, /3 = Va 2 — 2 is the 
equation of a circle. The solid curves show —0 tan and the dashed 
curves cot 0. There are no solutions for imaginary corresponding to 
negative values of S. (Verify this by trying to draw g and u for neg- 
ative &.) 
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For the value «i = 2 shown in the figure, there are only two solutions 
and their wave functions are plotted on the right. If a is increased, there 
will be more solutions and the character of gi and u x will change. For 
a —* oo , the values of 6 are r/2 and ir and the wave functions vanish at the 
edges of the well and are zero outside. This is the "vanishing" boundary 
condition (I) discussed in Section 14.4. 
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As discussed in Section 14.2, the wave equation is analogous to an elec- 
trical transmission line in which the circuit constants depend on the energy 
parameter in Schroedinger's equation. From Table 14.1, line 5, we see 
that, if C the capacitance to ground per unit length is kept constant, then 
current in the line is analogous to the wave function »|r in our case. We 
may take the solutions to have the form 

i|r(*, /) = ty(x)e iut (17) 

and 

/(*, t) = I(x) cos at. (18) 

These lead to the equations of Table 14.1: 
d 2 I(x) 



-LCo> 2 I 



dx 2 

3- -?.«-«* 



1m 



(19) 



(20) 



For a solution like gi, this leads to a positive value of LCw 2 for |*| < b 
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and negative for \x\ > b. The negative value corresponds to taking Lie 
as negative which can be accomplished by replacing the series inductance 
by series capacitances C s as shown in Figure 14.4, where the distributed 
constants are represented as lumped. For such a circuit, energy stored 
in the inductances cannot be lost at the ends since the capacitance attenu- 
ator networks cannot transmit power. This is analogous to the region of 
negative kinetic energy, in which probability density is present but into 
which the electron can penetrate only a short distance. Letting TD — > oo 
corresponds to letting £<o — » — °o or letting the series capacitors C s ap- 
proach zero capacitance. For this case the inductive part of the line 
would be open-circuited at the ends so that / would vanish at x = ±£. 

14.7 SOLUTIONS FOR A SIMPLE "CRYSTAL" 

In this section we shall discuss the wave functions in a one-dimensional 
crystal, such as might arise from repeating the "atoms" discussed in the 
last section periodically with a lattice constant a as shown in Figure 14.5. 
When this is done, it is found that there are bands of energy in which the 
wave function has the Bloch behavior and other bands in which the ampli- 
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Fig. 14—5 — Periodic Potential Composed of Symmetrical Unit Cells. 

tude increases exponentially in one direction or the other of the crystal. 
While these latter cases may be of interest in connection with surface states, 
they cannot lead to wave functions in the interior of a periodic lattice nor 
be used to satisfy periodic boundary conditions and they, therefore, cor- 
respond to energies in forbidden energy bands. 

Several possible procedures may be followed in showing how the energy 
bands arise. One may start simply with an arbitrary finite periodic 
potential and prove by very general means that allowed energy bands must 
be present. 1 On the other hand, one may deal with a specific model and 
derive formulae and calculate the bands in detail. 2 We shall take an 
intermediate course and assume that the potential in each unit cell is 
symmetrical about the midpoint, the midpoints being located at x — 0, 

1 H. A. Kramers, Physica 2, 483^*90 (1935), summarized in F. Seitz, The Modern Theory 
of Solids. See Problem 13 of this chapter. 

2 Seitz, loc. cit.; V. Rojansky, Introductory Quantum Mechanics, Prentice-Hall; A. H. 
Wilson, The Theory of Metals, Cambridge. 
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±a, ±2# • • • , so that we may deal with wave functions of the g(^) and 
u(x) form in the cell at x = 0. We shall then show that the problem can 
be denned in terms of two ratios of slope to value, 



(dg/dx) 

S 

(du/dx) 



■ evaluated at x = a/2. (1) 



We shall then consider the general behavior of "y and \i as functions of the 
energy and discuss how the energy bands are related to the atomic levels 
of infinitely separated atoms. 3 
We shall first prove that any solution of the wave equation for energy &, 

can be expressed as the sum of two solutions if^ and \|/2 having the property 
Wx + a) =e^(*) (3) 

ty 2 (x + a) = e- iPa/ %(x) (4) 

where P is real in the allowed bands and imaginary in the energy gaps. 
We shall give a proof based on the mathematics of -y and |i, leaving the 
general poof of this important result, which is known as Floquet's theorem, 
as a problem. 

We shall introduce the quantity (unrelated to in any other section) 
by the equation 

20 = Pa/h so that e iFa/n = e 2 * (5) 

and will shortly obtain an equation for /3 in terms of -y and |i. It will be 
found that there are three possible ranges for j3: 

P is real and —ir/2<p< +tt/2, allowed band; 

ft is imaginary, forbidden band; 

/} = (V/2) + an imaginary number, forbidden band. 

For the allowed band, the complex phase changes by less than 180° from 
one cell to the next and the absolute value is the same in every cell. For 
the other two cases the wave function decays exponentially in one direction 
or the other, like <)>+ and <j>_ of (7a) and (7b) of Section 14.6. The edges 

3 The problem of determining the "Surface States Associated with a Periodic Potential", 
namely, wave functions which decay away in both directions from the surface of a finite 
crystal, is conveniently treated in terms of 7 and p; W. Shockley, Phys. Rev. 56, 317-323 
(1939). 

4 See Whittaker and Watson, Modern Analysis, Cambridge, 1927, Section 19.4. 
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of the allowed bands come at energies for which real values of j3 are pos- 
sible. The equation for j3 is obtained as follows: 

In the cell at x = 0, we may write any wave function satisfying equation 
(2) in the form 

i|n = Ag(x) + Bu(x). (6) 

Furthermore, we require that the value of \|fi and dtyi/dx both be mul- 
tiplied by exp (iPa/h) = exp (2;/3) as x varies from — a/2 to +a/2. If 
this is true, the wave function will start at the left edge of the cell "1", 
centered at x = a, with exp (2/jS) times its slope and value at the left edge 
of cell "0". Since the wave equation is' second order, the solution is 
determined (for a fixed value of S) if slope and value are given; conse- 
quently, the value of the wave function in cell "1" is exp (2*0) times its 
value in cell "0". The same relationship will then hold for cell "2" with 
respect to "1" and so on; therefore, equation (3) will be true. Since 
x = is the center of symmetry of the cell, we have from the symmetry 
properties [(3) of Section 14.6] of g(^) and u(x), 

iJri(-«/2) = Ag(-a/2) + Bn(-a/2) = Ag(a/2) - Bu{a/2) (7a) 

i|n'(-«/2) = Ag'{-a/2) + Bu'{-a/2) 

= -Ag'(a/2) + Bo.' (a/2). (7b) 

We shall hereafter omit the argument a/2, except where confusion would 
result, it being understood that g, g', u, and u' are all evaluated at -\-a/2. 
We next write 

*|ri(«/2) = Ag + Bu = % 2i ^{-a/2) = e 2i ?(Ag - Bu) (8a) 

i|n'(«/2) = Ag' + Bu' = e 2 %'(-«/2) = e 2i *(-Ag' + Bu.'). (8b) 

These can be rewritten as 

(1 - e 2ifl )gA + (1 + e 2ifi )uB = (9a) 

(1 + e 2if) )g'A + (1 - e 2i/3 )u'5 = 0. (9b) 

These equations can be solved for A and B if and only if the determinant 
of the coefficients vanishes. This gives 

(1 - e^V ~ (1 + e 2l '<Wu 

= 4e 2i ^[- sin 2 /3gu' - cos 2 /3g'u] = 0. (10) 

The square bracket must vanish, which requires that 

tan 2 j8 = -g'u/gu' = — y/(i. (11) 

This equation can be solved by a real value of $ if and only if •y/li is nega- 
tive. Furthermore, if ^ is a solution so is — j8 and so is /3 + mc where n 
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is an integer. Thus if P = 2jih/a is one solution, so are 

P + Irk/ a = P + h/a, P + 2h/a, P + 3h/a, etc., (12) 

corresponding to the periodicity discussed at the end of Section 5.5. The 
reason is now seen to be that P enters the theory through equations (8a) 
and (8b) in the form of the factor exp (iPa/h) which is unaltered by adding 
integral multiple of h/a to P. 

The fact that the solutions occur in P and — P pairs shows for this case 
that the quantum states corresponding to P and —P in the Brillouin zone 
have equal energies. This result was used in a three-dimensional form in 
Sections 6.4 and 8.4 to show that the current is zero for a full Brillouin 
zone and a partially filled Brillouin zone under conditions of thermal 
equilibrium. The three-dimensional result is readily obtained and is 
stated as a problem at the end of this chapter. 

No matter what the value of y/\i is, however, there will always be two 
solutions which give rise to two functions of the form given in equations (3) 
and (4). Since the wave equation can have only two linearly independent 
solutions for a given energy, any solution can be expressed in terms of 
their sum. A special case occurs if P — or h/2a is the solution, for in 
this case equations (3) and (4) become equivalent. For this special case, 
one solution is of type (3) and has either the same value in every cell or 
varies from cell to cell simply by being multiplied by —1. This solution 
represents the limiting case obtained by adding \\r + p and ty—p so as to get 
a cosine-like function in the energy band or a cosh-like function in the 
forbidden band. The other solution is not of type (3) or (4) and repre- 
sents an intermediate case between a sine-like and a sinh-like function. 
Its value increases not by a factor but by an additive constant in going 
from one cell to the next so that it satisfies the condition 

\\i(x + na) = r\t(x) + nC (13) 

where C is a constant. The actual construction of these solutions is simple 
and is presented in a problem. 

For positive values of y/\L we must distinguish between two cases: 
If y/\L < 1, then the solution is j3 = ±«x which leads to 5 

- tanh 2 x = — y/|i (14) 

which can always be solved for a real value of x- For this case the factor 
from one cell to the next is 

e 2tf = e ±2 X (15) 

so that one possible solution builds up in the +#-direction and the other 
decays. For y/p. > 1, the solution is /8 = ±»x + *"/2; this yields the 

5 X is unrelated to \ of other sections. 
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— coth 2 x = ~y/v- 



and the decay factor is 



e 2t|3 _ e ±2x+Tt _ 



,±2X 
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(16) 



(17) 



so that the solution decays as for equation (15) but alternates in sign from 
cell to cell. 

We shall next describe how the energy bands arise from the atomic 
energy levels of the model of the last section as the atoms are brought to- 
gether from infinite separation (that is, as a varies from large to small 
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Fig. 14-6 — Dependence of g(x) upon Changes in the Energy Parameter in Schroe- 
dinger's Equation. (Not exact.) 

values), the method of attack being graphical in terms of the •y and (i 
curves. In Figure 14.6 we show the g(x) function for its "atomic" eigen- 
value S g i, and for two slightly differing energies. For S gl , the value of 
■y is independent of a and has the value, deduced from <j> + of equation (7b) 
of Section 14.6, 



d$+/dx 



= -[2m ("ffl - S,i)/ft a ] 



2lH 



(18) 



For & < S gl , the wave function is not an eigenfunction because the 
cosine function in the potential well does not join properly to the expo- 
nentially decaying function <|> + but requires some of the "building-up" 
function <j>_. For large values of x, the <(>_ part dominates and the value 
of y is positive with a value of 

7 = +[2»C® - &')/&]*■ (19) 

A similar situation holds true for S" > S gl , the only difference being that 
<)>_ has a coefficient of the opposite sign, a change which does not affect (19). 
The resultant behavior of -y as a function of & is shown in Figure 14.7. 
For a large spacing such as a 3 , the value of y is given by equation (19) 
except for very narrow ranges of energies centered about S g i and about 
S g2 , it being assumed that the well has two even energy levels. If § is 
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decreased slightly from S g i towards S', g increases and dg/dx vanishes so 
that -y increases; for a slight increase in &, g(a 3 ) drops to zero, leading to 
■y = — oo ; and a further decrease leads to large positive values followed 
by a quick drop to the line of equation (19). For a smaller spacing, a^, 
the effect of energy changes is less abrupt and the range -y = to ^ = ±oo 




_. — -7 FROM 
EQUATION (19) 



1 FROM 

EQUATION (18) 



Fig. 14-7 — Dependence upon Energy of y - (dg jdx) /g at the Edge of Cell, x = all. 

(Not exact.) 

requires the range of energy &' to $". For all values of a larger than 2b 
of Figure 14.5, however, -y assumes the value of equation (18) when 
§ = S g i and a similar statement is true for other energy levels. 

Quite similar arguments apply to |i. 

We must next consider the relationship between -y and (i and the result- 
ing structure of the energy bands. In Figure 14.8 we show the structure 
for two values of a. On the left a = a Sy a large spacing. When S = & gl , 
the g function decays exponentially outside the well but the u function 
does not. Consequently, \l will have the positive value given by equa- 
tion (19) at this point. At & = & ul , \i has a behavior similar to -y at 
S g i. The remaining behavior of the curves may be deduced by a set of 
general theorems. We shall prove the most essential theorem here and 
rely on qualitative arguments for the remainder of the demonstration, 
leaving the other theorems as problems. 

Theorem: The y and (i curves never have a point in common, i.e. they do 
not cross or touch. 



Proof: Suppose the two curves have a point in common. Then for the 
particular energy and value of a concerned, 



g'(«/2)/g(«/2) = u'(a/2)Ma/2). 



(20) 



14.7] SOLUTIONS FOR A SIMPLE "CRYSTAL" 

Next consider the function 

g(*/2) 



+(*) = 



u(*/2) 



u(«). 
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(21) 



Then at x = a/2, the two functions i|f(#) and g(#) are equal and have 
equal derivatives. Since they satisfy the same second-order differential 




ENERGY 
BANDS 

ENERGY 
GAPS 



P VALUES O 



W 



(b)a = < 



Fig. 14-8— The Structure of the Energy Bands for the Two Lattice Constants of 
Figures 14-6 and 14-7. (Not exact.) 

equation, i|/(#) = g(x) for all x. But this cannot be true because at 
x = 0, u = but g^O. Therefore the assumption -y = (i leads to an 
absurdity. (The reader may generalize this by showing that "y = °° 
and ]L = oo cannot occur for the same value of &.) 

Another theorem, given in the problems, states that both d\Jd§> and 
d\i/d& are negative except when -y or \i = ± =° . 

By considering the situation near & gi for a — a 3 , we see that the allowed 
energy band arises from the region in which y is negative (that is, where 
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y/\L is negative). On the left edge of this band, 7 = and P = 0, cor- 
responding to the center of the Brillouin zone. On the right edge, -y = — °° 
and jS = 7t/2 and P = h/2a, corresponding to the edge of the Brillouin 
zone, and the wave function changes sign from cell to cell. For larger 
values of 6, -y is positive and larger than \l. Since the -y and |i curves 
cannot cross, -y remains greater than \l and y/\L is > 1 until the situation 
|i = and y/y. = 00 is reached. Between these two infinities in y/\L, 
the value of /3 is i\ + v/2 corresponding to the exponentially decaying or 
increasing type with alternating sign. In the next band, centered about 
& ul , P decreases from h/a to zero and, in the next gap, the exponential 
functions do not alternate in sign. 

The behavior just discussed is in accord with diagrams like Figures 5.9 
and 6.4 in which the energy curves have their minima alternately at 
P = and P = h/2a. 

It is also evident that the energy bands grow out of the atomic levels 
since S g i occurs in the negative -y region, and, at least at large spacings, 
fj. is positive there so that y/\L is negative, corresponding to an allowed 
band. 

From the theorem which shows that dy/d& and dy./d& are negative 
and that they must go through cycles from +=0 to — 00 , it follows that 
there must always be allowed bands of energy. This may be seen as fol- 
lows: At any point where -y = 0, either \i is finite or infinite. If |i is 
finite, then -y/ji changes sign when y = so that this point corresponds to 
the boundary point between a band and a gap. If ji is », then |t must 
change from — 00 to + 00 while y changes from positive to negative. 
For this case then, energy bands exist to both sides of the point, since 
■y/|i is negative on both sides. 

The case of the preceding sentence will actually occur for our model 
when a, the lattice constant, equals lb, the width of the well, so that the 
well bottom extends throughout the crystal. Under these conditions 
•y and |i become identical, except for a factor b, with the curves for 
bty'(b)/y\t(b) of Figure 14.3. These curves have tan 6 and cos 6 as factors 
and one goes through zero as the other goes to infinity and vice versa. 
Thus for the condition a = lb, there are no energy gaps and the energy 
spectrum is perfectly continuous, as it should be for an electron moving in 
free space. In this event, it is still possible, though somewhat artificial, 
to classify the wave functions as before, in terms of the Brillouin zone for 
lattice constant lb. If this is done, however, the energy curve becomes 
simply the free electron "parabola", 

g = l wt ,2 = pyim, (22) 

folded back on itself into the first Brillouin zone. (Figure 14.9 represents 
this case modified as discussed below.) 
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If a is slightly greater than lb, then very narrow walls will be left between 
the wells and the electrons will be almost free. However, when P is a 
multiple of h/2a, the Bragg reflection condition will be satisfied and the 
group velocity will be zero. This is produced in the -y, (i-diagram by a 
slight offset in either direction of the -y = point from p. ± » and vice 
versa. The effect on the free electron curves is shown in Figure 14.9. 




-3h/fca -ry6 -h/2a o h/fea h/a 3h/fea 

CRYSTAL MOMENTUM, P— *• 

Fig. 14-9 — Energy Bands for Almost Free Electrons. (Not exact.) 

(See the Problems for details.) The curves are characterized by the sharp 
curvatures at the band edges. These may be interpreted 6 as correspond- 
ing to a very small effective mass, since near these points 

v = d6/dP (23) 

changes rapidly with small changes in P so that 

dP 

where m is the free electron mass and \/m corresponds to the curvature of 
the free-electron parabola. 

The smallness of the mass suggests the following problem: Suppose an 
electron is in a wave-packet state corresponding to point S u in Figure 
14.9, the minimum of the third energy band. If a field is applied, the 
wave-packet will be rapidly accelerated and will move across the crystal 
in much less time than would a perfectly free electron initially at rest. 
How can this come about? Where does the electron in the third band 
get its extra energy ? The answer is that it had the extra energy already. 
State S t can be shown to correspond to the superposition of two running 
waves corresponding to P = +h/a and P = -hi a. Thus, although 

6 The basic theory for group velocity is given in Sections 15.2 and 15.4. 
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state £1 has no net electron current, it represents a state of considerable 
energy. The effect of the applied field is to shift the electron to state S 2 
by causing changes in the reflection conditions of the two running waves. 
As a result, all of the kinetic energy becomes associated with motion 
towards the right and the electron wave-packet is greatly accelerated, 
Once state ^2 is reached, however, the transfer of kinetic energy from one 
motion to another is complete, and the effective mass for additional accel- 
eration has its normal value. 

Ordinarily the masses for holes and electrons will be of the order of mag- 
nitude of the electron's mass. If the almost free electron approximation is 
used to represent the energy bands of the semiconductor, an erroneous 
impression is given of extreme smallness for the effective mass at the edges 
of the bands. The difference between the actual case and the almost free 
electron case comes about as follows: For the free electron case the zeros 
of y are paired with the infinities of |i and vice versa. The small disturb- 
ance in potential offsets these slightly and gives very narrow energy gaps 
with resultant very small effective masses at the band edges. (See Prob- 
lems.) For true valence-bond wave functions and conduction-band wave 
functions the situation will be quite different and, in general, there will be 
no simple relationships between y and |A, or rather their three-dimensional 
analogues. At atomic spacings the bands will spread by energies com- 
parable to the binding energy, that is, to a few electron volts, and the 
energy gaps will be of the same general order. From dimensional argu- 
ments we thus expect the effective mass to satisfy approximately an equation 
of the form 

AS = P ma , % 2 /2m eif = h 2 /Sa 2 m ell . 

Since AS = 0.1 atomic unit and h/a = 2irh/4a = (ir/2)h/a = (ir/2) 
atomic units of momentum, 7 this leads to 

m ei{ = (tt/2) 2 /8 X 0.1 = 3 electron masses. (25) 

This argument is intended to show only that effective masses of the order 
of the electron's mass should be the rule rather than the exception in well- 
bonded semiconductors. 

The discussion of the number of eigenfunctions in each energy band for 
periodic boundary conditions given in Chapter 5 is directly applicable to 
the wave functions of the one-dimensional periodic potential. The prob- 
lem of counting for the case of an even number of unit cells involves the 
equivalence of the wave functions for P = h/2a and P = —h/2a; for this 
condition there is only one solution which can satisfy periodic boundary con- 
ditions since the other solution varies as described in equation (13) and 
Problem 2. 

7 See Appendix A. 
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14.8 BLOCH FUNCTIONS FOR THREE DIMENSIONS 

In this section we shall derive Bloch's theorem that the wave functions in 
a crystal are of the form 

*|f(r) = e iP ' r/ V(r) (1) 

where u P (r) has the periodicity of the lattice. 1 We shall restrict the proof 
to a crystal with periodic boundary conditions, for which case P is a real 
vector. We shall also verify in general that the volume of the Brillouin 
zone is h z /V a where V a is the volume of the unit cell and that the density of 
allowed points is VJt? where V A is the volume of the crystal, and hence 
that the number of states (neglecting the spin) in the Brillouin zone is 
V A jV a = N where N is the number of unit cells in the crystal. 

The basic property of the crystal is that it can be translated by certain 
vectors so that after translation each atom has been moved into the position 
occupied by an equivalent atom. If the potential energy of the electron 
is "^(r), then for a permitted translation, for which we shall use the symbol 
R or R a in this section, the relationship 

GU(r + R) = 6U(r) (2) 

must hold. 

The set of allowed translations R forms a lattice, which is defined as 
follows: Take any point in space as the origin and consider all points dis- 
placed from it by vectors of the form R. The array so obtained is the 
lattice corresponding to the crystal structure. (In Figure 1.3, if a corner 
of the large cube shown is chosen as the origin of the lattice, then the atoms 
on the faces are at lattice points; the atoms on the interior, however, are 
not at lattice points.) The R's can always be described in terms of a set 
of primitive vectors. 

flij <*2> o 3 = primitive vectors. (3) 

The primitive vectors themselves must be vectors of the lattice and, 
furthermore, it must be possible to express every vector R of the lattice in 
the form 

R = audi + a 2 a 2 + a 3 a 3 (a's are integers) (4) 

where the a's are positive or negative integers. The primitive vectors 
define a parallelepiped, of which they are the edges, and this parallelepiped 
has a lattice point at every corner. Furthermore, it can have no lattice 
point except at corners because, if it did, in the set of a's for this point at 
least one a would lie between and 1 in contradiction to (4). From this 
reasoning, it follows that there is exactly one lattice point for each parallele- 

1 This is the customary use of U in this book, u for ungerade is restricted to Sections 14.6 
and 14.7 and the Problems. 
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piped having edges a u a 2 , a 3 . Furthermore all space can obviously be 
filled by repeating these parallelepipeds by translation of the form (4). 
For this reason, the parallelepiped is referred to as a unit cell. The crystal 
can evidently be divided into such cells, each of which will have'the same 
internal structure and it will be impossible to divide these cells further into 
equivalent parts which can be made to coincide by translation. 
The volume of the unit cell is 

V a = a x - (a 2 X a 3 ) = a 2 ■ (a 3 X a t ) = a 3 • (a x X o 2 ) (5) 

where it is assumed that a u a 2 , a 3 form a right-handed system. (If they 
do not, a new set with a/ = — a x can be chosen.) From the previous dis- 
cussion, it is evident that V a is the volume per lattice point and must, 
therefore, have the same value for every set of primitive vectors. In 



£7 ' ' 




Fig. 14-10 — Two Sets of Primitive Vectors in a Two-dimensional Lattice. 

Figure 14.10 we show two ways of selecting primitive vectors in a two- 
dimensional lattice. The reader should verify that a x and a 2 can be 
expressed in form (4) in terms of a/ and a 2 and vice versa. 

We now suppose that the wave functions satisfy periodic boundary 
conditions, the periods being 

A x = N x a u A 2 = N 2 a 2 , A 3 = N 3 a 3 , (6) 

so that 

ij,(r + A) = f(r), (7) 

where A is any linear combination of the Ai's with positive or negative 
integers as coefficients. The volume of the crystal, denoted by V A , is 
related to V a as follows: 

V A = A x - (A 2 X At) = N!N 2 N 3 F a = NV a . (8) 

Suppose v|f (r), satisfying the boundary conditions of equation (7), is a 
solution of Schroedinger's equation with eigenvalue &. (The existence of 
such solutions can be proved without too much difficulty.) Then it is 
evident that 

W) =M r + R ) (9) 

is also a solution for the same value of & since it simply represents displacing 
\|/ by a vector — R and the values of ^(r) and 6 U(r + R) are equal. 
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Our problem is to prove that if \Jf 0) exists, it can always be expressed as a 
sum of one or more Bloch-type functions which satisfy the condition 

$ B (r + R)= e 2 "™ <j, s (r) (10) 

B = P/h (11) 

and are eigenfunctions with energy &. This means that for the case of periodic 
boundary conditions it is unnecessary to consider functions other than those 
of the Block type, since all solutions of the problem can be expressed in terms 
of the latter. 

In order to deal with this problem we introduce two additional sets of 
vectors gt and G;, defined as follows: 

Gt = a 2 X a 3 /V a , G 2 = a 3 X aJV a , G 3 = a x X a 2 /V a . (12) 

The Gi's are called the reciprocal vectors of the crystal lattice. The vectors 

gi = A 2 X A Z /V A , g 2 = A 3 X A X IV A , g s = A x X A 2 /V A , (13) 

are reciprocal vectors of the periodic boundary periods and have no special 
name. It follows from these definitions that 

a t ■ Gj = 8ij - Ai ■ gj (14) 

and also that the volumes in G-space, or reciprocal space, of the paral- 
lelepipeds with edges G» and gi are 

V G = G 1 G 2 X G 3 and V Q = g t ■ g 2 X g 3 . (15) 

It also follows that there are inverse relationships of the form 

0l = G 2 X G 3 /F G , A x =g 2 X g 3 /V g , (16) 

as can be proved by noting that fiti satisfies ai • Gj = b\j and that this 
uniquely determines a vector a\. From the vector theorem 

u X (v X w) = (u • w)v — (u • v)w (17) 

it follows that Oi • a 2 X o 3 when expressed in terms of the G's reduces to 

V a - (G 2 X G 3 ) • [(Gs X Gx) X (G, X G 2 )\jV G A 

= (G 2 X G 3 ) ■ {[(Gs X GO • G 2 ]G X - [(G 3 X G x ) • G^}/^ 3 

= V G 2 /V G * = \/V G , (18) 

since (G 3 X G\) ■ Gi vanishes. A similar relationship holds between 
V A and V g : 

V A V g = 1. (19) 

There are JV distinct translations subject to the periodic boundary con- 
ditions and these may be written as 

R a = aiOi + a 2 a 2 + a 3 a 3 (20) 
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where the a's are integers running from to Ni — 1, N 2 — 1 and N3 — 1 
respectively. There is no need to consider R's with a's larger than these 
because such an R, with ai = Ni + 2 for example, would produce the same 
effect as R' = R— A x which has <x/ = 2 lying in the range 0<a/<iVi— T. 
There is a similar set of 2?'s, also yV in number, defined by 

By = 7101 + 7202 + 7303, < 7; < Ni - 1. (21) 

We shall show that these B's are the appropriate B's for the Bloch functions. 
In terms of the set (20) of R a and set (21 ) of B y we shall next define two 
sets of functions generated from tyo(r) by translations and in terms of these 
new functions we shall prove that \|fo (r) can be expanded in terms of one or 
more functions of the Bloch type. The new functions, denoted by \(f a 
and <J) T , where a and 7 are triplets of integers of the form (20) and (21), are 
denned as follows: 

i|r a (r) = *|r„(r - R a ) (22) 

and 

+ y = i\r* E e 2 '*^ q a = Z S ya ^ a (23a) 

a a 

where the S ya quantities are matrix elements for the transformation 
relating the \|/ a to the § y and 

S ya = &-*#*<*■,•**. (23b) 

It is evident that each of the set of <{>'s is an eigenfunction for the same 
energy § since each i|/ a is such an eigenfunction. 
We shall next prove that 

$ y (r + R) = exp (2wiB y ■ R)$ y (r) (24) 

which shows that § y is of the Bloch type with P = 2rhB y = hB y . Finally, we 
shall prove that 

+« = IT* E exp (-2riB p ■ R a )fy = E Tad,, (25) 

which shows that any tyo, or y\r a obtained from it, can be expressed as a sum of 
the Bloch-type ty's. 

To prove that <|) 7 is of the Bloch type, we evaluate 'VN§ y (r + R) as 
follows : 

VN+y(r + R) = E exp (2xi5 7 • R a )^ (r + R - R a ) 

a 

= E exp [2iriB y ■ (R a , + R)]ty (r - R a >) 

a' 

= exp (2*iBy ■ R) E exp (2«'B 7 • R a ')tya'(X) (26) 

a' 

where we have replaced R a — R by R a > so that the sum over a' now 
extends outside the allowed set of a values. Each R a > outside is, of course, 
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equivalent to an R a in the set which differs by an A vector. Replacing an 
outside R a ' = R a + Ax, for example, by the inside R a does not change the 
sum because 

<?** rAl = exp (2«7iffi ■ A{) = exp (2«7i) = 1 (27) 

so that the exponential is unchanged. Furthermore, v|/ a = \|/ a ', if R a > = 
Ra + ^i. Thus a' may be replaced by a in the sum of equation (26), so 
that (26) becomes 

VN$ y (r + R) = exp (2mB y ■ R)VN^ y (r), (28) 

which is the desired theorem of equation (24) except for the factor iV . 
Next we shall prove that any \J/ £ can be expanded in terms of the fo by 
proving that \j/ e is equal to the sum 

AT* £ exp (-IxiBp ■ R e )fa = {-.IE exp [IriBp ■ (R y - /? e )]x|/ 7 , (29) 

(3 TV (3 7 

the latter equality resulting from the replacement of <j>0 by equation (23). 
The term in the exponent may be written as 

2iri(fix9i + /3 2 2 + 18303) • («i«i + a 2 a 2 + 0:303) 

= 2«[(/WiV 1 ) + GW^) + GW^a)]. (30) 

where the a's are integers (possibly negative) because of the form of R y 
and R e . For a fixed value of 7, the sum over /} is equivalent to a triple 
sum over j8i, fo, and fe and may be written as 

Z £ £ exp {2x/[08 iai /M) + 0WiV 2 ) + C8 8 «8/iV8)]} 

= {1 + exp [(2«« 1 /iV 1 )jSi] + • • • + exp [(2«« 1 /iV 1 )(iVi - 1)]} 

X {1 + • • • + exp [(2«a 2 /yV 2 )(iV 2 - 1)]} 

X { 1 + • • • + exp t(2Wos/iV3) (iV 3 - 1)]} . (31) 

The first term is Nx if ax = or a x = ±Nx. However, the latter condi- 
tion does not arise from R y — R ( . If ax is not zero, the first term vanishes 
as may be seen by writing it in the standard form for a geometric series; 

{1 - exp [(2*iax/Nx) ■ Nx]}/[1 - exp (2xi« 1 /iV 1 )], (32) 

in which the numerator vanishes, while the denominator does not. Hence, 
the triple sum over /3i, /3 2 , 183 vanishes unless ax = a 2 — «3 = 0, that is, 
unless R ( = R y . If R t = R y , the sum becomes NxN 2 Nz = N so that we 
may write equation (29) in the form 

N'X £ exp (-2WB0 • i? £ )^ = £ M»t = +.- (33) 

7 

This proves that i|/ 6 can be expanded in terms of the <j>j3 which was the 
basic aim set forth in equation (10). 
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Furthermore, for periodic boundary conditions characterized by A 1} 
A 2 , and ^3, Bloch-type functions <f> 7 with non-integral values for the coef- 
ficients ji are not allowed because, if 71 were non-integral, then we should 
have 

4> T (r + At) = exp (2«ti)«|>t(i') j (34) 

and, if 71 were non-integral, the coefficient would not be unity and $ y 
would not satisfy the boundary conditions. Thus all allowed B vectors, 
that is, allowed by the boundary conditions, correspond to integer values 
for the 7i s. 

Equation (33) also shows that the matrices Sp a of equation (23) and 
7\p of equation (25) are reciprocal since 

llTypSpa = 5 ya . (35) 

(9 

In addition 

T 7/3 = Sfi y * (36) 

so that the transformation is unitary, a feature which leads to simplifying 
results in many cases but which we do not use further here. (See Problem 
3 of Section 17.3.) 

Next we shall see how the Brillouin zone may be defined in terms of the 
Gi vectors and examine the equivalence of different B vectors. So far as 
their translation al properties are concerned we shall show that §b and <j>g' 
are equivalent if B and B' differ by integral multiples of the G/s, that is, 
by a vector of the reciprocal lattice of the G/s, the G»'s being the reciprocal 
vectors of the crystal lattice as defined in equation (12). The result follows 
at once because in 

4> s (r + R) = exp (2«S ■ R)$ B (r) (37) 

the exponential is unaltered by adding G\, for example, to B since 

Gi ■ R = Gi • (aifli + a 2 a 2 + a 3 a 3 ) = a x (38) 

so that the added exponent is a multiple of 2x/. Hence we say that two 
B vectors which differ by multiples of the Gj's are equivalent. Conse- 
quently, it suffices to consider only those B's lying in the unit cell having 
edges Gi, G 2 , and G3 and a volume 

G x ■ (G 2 X G 3 ) = V G = l/V a . (39) 

B's lying outside of the cell are equivalent to B's inside it. This cell con- 
tains just the set of N vectors B of equation (21), since B's with 71 = iVi, 
etc., are equivalent to those with 71 = 0, etc. The density of these allowed 
points in G-space is thus 

N/V G = NV a = V A (40) 
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which is identical, of course, with the reciprocal of the volume of a unit 
cell of the <7; lattice upon which the allowed B's fall. 
Since the P vectors are h times the B vectors, 

the density of allowed 1 _ „ ,, 3 



• • „ i = VJh*, (41) 

points in P-space J 

or 1VJJ? if the two possibilities for the spin are taken into account. This 
formula is the same as that derived in Section 5.5 for the simple cubic 
lattice. 

It is possible to translate the unit cell of G- or P-space so as to center it 
about the point P = 0; it will then still contain N distinct allowed B 
vectors. However, the unit cell is somewhat arbitrary because the basic 
di and Gi vectors can be chosen in several ways. The first Brillouin zone, 
however, does not have this degree of arbitrariness and is defined in the 
following way: Make a lattice in G-space by taking all points of the form 
n\G\ + ^2^2 + n 3 Gs for positive and negative integer values of the «'s. 
This array of points is called the reciprocal lattice. Assign to each lattice 
point that part of G-space nearer to it than to any other point. Then the 
space around the point at the origin is the first Brillouin zone. Its special 
importance is that for B vectors ending on its surface the conditions for 
Bragg reflection are satisfied. For the case of almost free electrons, the 
surface becomes an energy discontinuity, or energy gap, and the component 
of group velocity in a direction perpendicular to the surface vanishes when 
B ends on the surface. 

It may be shown (see Problem 15) that no two points in the first Bril- 
louin zone are equivalent. Furthermore, the volume of the zone is that 
of one unit cell in the reciprocal lattice. Hence, it must contain a complete 
set of N distinct B's. 

The construction of Brillouin zones is treated in considerable detail in 
a number of standard texts. 2 

Problems 

1. Discuss the relationship between the following manifestations of a 
common cause: For a large value of a, such as a% in Figures 14.6 and 14.7, 
(1) the energy bands are narrow, (2) the effective mass in the bottom band 
is large, and (3) the transmission of an electron wave-packet is small. 

2. Show that at the edges and center of the Brillouin zone, the solutions 
are of the form given in equation (13) of Section (14.7). For example, 

2 A. H. Wilson, The Theory of Metals, Cambridge at the University Press, 1936. N. F. 
Mott and H. Jones, The Theory of the Properties of Metals and Alloys, Oxford at the Clarendon 
Press, 1936. F. Seitz, Modern Theory of Solids, McGraw-Hill Book Co., New York, 1940. 
L. Brillouin, Wave Propagation in Periodic Structures, McGraw-Hill Book Co., New York, 
1946. 
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consider the lowest state of the first band corresponding to y = 0, or 
g' (a/2) = g' = 0. Show that one solution is 

t|fi(#) = Ag{x — na) for * in cell centered at * = na, 

which corresponds to P = 0, and that another is, similarly, 

4*2 (#) = «(* — na) + 2n[u(a/2)/g(a/2)]g(x — na). 

What happens if a multiple of tyi is added to \J/ 2 ? Find the corresponding 
solutions for other zone edges and show that they are transition types 
between cases for real P and cases for complex or imaginary P. 

3. Prove that the extreme energies in each band come at P = ±h/2a 
and P = and that the energy varies monotonically between these limits. 
Do this by using the relationship between velocity defined by v = d6>(P)/dP 
and current discussed in connection with equation (4) of Section 14.3, 
which may be written as 

im\_ dx dx j 

If v = 0, corresponding to a flat region in the 5 versus P curve, then show 
that 

dlnty dlnty* 
dx dx 

From this show that P is either zero or a multiple of h/2a. (It is thought 
by most students that this theorem has no three-dimensional analogue 
and that energy extremes may occur inside the Brillouin zone.) 

4. Show from Problems 2 and 3 that at the edge of the energy band 
the average current is zero and hence by the group- velocity formula that 
dS>/dP = at the edge of the band. 

5. Consider the -y, (i-curves for an "almost free" electron. For this 
purpose, recenter the unit cell in the midpoint of the narrow section which 
is ID high and a — 2b = 2c wide. The potential is then 

m(x) = TO |*| < e 

%(*) =0 c < |*| < a/2. 

To simplify the problem, let c — * and ID — > oo so that c°® = V remains 
constant. Prove that 

u(*) = sin (2m&/h 2 ) ii x |*| < a 

g(x) = cos [(2m&/h 2 ) H \x\ - a] \x\ < a 

where 

a = V2m/& V/h. 
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From these show that the lowest energy state (P = 0) has energy & = 
IV la, equal to the average potential energy throughout unit cell. Show 
that the energy gaps are W/a wide. Verify that the curves will have the 
shape of Figure 14.9. From this conclude that the effective mass is of 
the order of mF/aS where 6 is the free electron energy, corresponding to 
V = 0. Derive an exact formula for the effective mass at the edge of the 
zone. [The equality of the energy gaps is a consequence of letting c — ► 0; 
for a more realistic potential the gaps become narrower at high energies.] 

6. In Figures 14.7 and 14.8, cases in which & was less than ID were 
stressed. How will the -y and (i curves behave for 5 > ID ? It may be 
helpful to consider Problems 9 to 13 first. 

7. Prove that S(P) = 6>( — P) for the one-dimensional case for a 
periodic potential which is not necessarily symmetrical about the center of 
the unit cell. Do this by writing \|/p in the form t]/p = exp (iPx/h)u P (x) 
where u P (x + a) = U P (x). Extend the result to three dimensions. 

8. Show that, for a symmetrical potential, the function u P (x) of Problem 
7 can always be written as a real even function of x plus an imaginary 
odd function of x. 

Remark: The following four problems give the steps necessary to prove 
that the -y and |A-curves used extensively in Section 14.8 decrease mono- 
tonically with energy from + °° to — °° . The theorem that they do not 
cross was proved in Section 14.7. These features are all that is necessary 
to show that the energy band structure discussed in Figure 14.8 is a general 
phenomenon of a one-dimensional symmetrical periodic potential. Prob- 
lem 13 shows how the results can be extended to a general periodic 
potential. 

9. Given a periodic symmetrical potential ^(x) where %(#) = %( — x) 
and %(* + «) = ^x) which is finite so that < ^x) < IT, prove that, 
for S < 0, y and ]i (defined in equation (1), Section 14.7) are both > 
and that g(x) and u(x) have no nodes (zeros) in the unit cell. Prove that 
for S large enough, order of h 2 /a 2 m, g and u must have nodes in the unit 
cell and large numbers of nodes for & very large. 

10. Suppose that a wave function i\ii(x) which satisfies Schroedinger's 
equation for energy S in a bounded potential energy field, like that of 
Problem 9, has two consecutive nodes, at x = a,\ and x = b\ > a,\. 
Consider \|/ 2 , also having energy &, with a node at a 2 slightly less than a x ; 
prove that it has a node at i 2 < h- For this purpose, consider iiiC*) = 
[d-tyi(x)/dx]/-tyi(x). Prove that iji varies from + » for x = «i + to 
— oo for x = i>i — and is continuous except for the end points where it 
varies as T[i = l/(x — a{) and l/(x — h)- Prove that i^C*), corre- 
sponding to \|/ 2 , and iiiCv) are never equal. Hence prove that, since 
7)2 («i + 0) < i\i(ai 4- 0), ii2 must go to — <» for x < b\. 

11. Consider the function \|fi of Problem 10, which corresponds to 
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energy Si. Let ty 2 also have a node at a\ and be a solution for the wave 
equation for energy & 2 where S 2 is slightly greater than &%. Prove that 
the node of \\i 2 occurs at b 2 < h- 

Hint: Make a diagram showing i)i(x) varying from +00 at «i to — °° 
at b\. Prove that i\ 2 = i\i = l/(# — «i) as x — > a x but that r\ 2 (x) < 
Hi(#) for * — «i finite. Show that if b 2 > b u then i\ 2 (x) > f\i(x) for 
b x — x finite. Hence b 2 must be less than b x . In other words, increasing 
the energy decreases the distance between nodes (like decreasing wave 
length). It may be helpful to derive the equation 

for use in this problem. 

12. From Problems 9, 10, and 11 show that -y and (i curves, like 
those of Figures 14.7 and 14.8, must decrease monotonically with increasing 
energy. (For example, consider u(x) which has a node at x = 0, and 
suppose that the energy is so low that it has no node inside the unit cell. 
Then the method of Problem 9 shows at once that increasing & causes 
(X to decrease at the edge of the cell. The cases with several nodes may be 
treated by considering the effect of increasing & upon the node spacing 
and upon the incurve between the nodes.) 

13. Prove for a general (possibly unsymmetrical) periodic potential 
^(x + a) = G U(*) that the solutions are of the form 

y\ii(x + a) = \\fyi(x) 

i\r 2 (x + a) = X _1 \|f2(*). 

Do this by letting s(x) (s for slope) and v(x) (v for value) be the solutions 
which satisfy Schroedinger's equation and which take the values 

s = and ds/dx = 1 at x = 

v = 1 and dv/dx = at x = 0. 

Denote the values and derivatives at x = a by s, s', v, and v . Prove that 
for any two solutions <(>i and <j>o of Schroedinger's equation, 



dx 
and hence that 



[♦■£-♦■£]-<> 



s'v 



p' = 1. 



Show that the periodicity requirement on ^ = As(x) -\- Bv(x) leads to 
(X - o)(X - s') - sv = X 2 - X(t> + s') + 1 = 0. 

From this conclude that if the two roots are Xi and X2, Xx = 1/X 2 so 
that Xi and Xi — l are the transformation factors. Show that X is real for 
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\v + s'\ > 2 or else |\| 2 = 1 far \v + s'\ < 2. This proves there are no 
damped complex wave functions. Using Problems 9, 10, and 11 
draw curves for s(x) and v(x) versus x and show that their nodes are 
alternating and that, consequently, \v + s'\ < 2 is surely satisfied for 
certain energy ranges for any lattice constant a. 

14. Prove that if R u R 2 , and R s are three allowed translations of the 
lattice and if Ri ■ R 2 X R3 = V a , then these R's may be used as set of 
primitive vectors, equation (4) of Section 14.8. 

15. Prove that the surfaces of the first Brillouin zone (see the end of 
Section 14.8) are planes which are perpendicular bisectors of vectors from 
the origin to other points of the reciprocal lattice. Prove that no two 
points in the interior of the zone are equivalent. 

16. Consider a lattice in space with primitive vectors eti, a 2 , a 3 and 
reciprocal vectors G u G 2 , G 3 . Consider the family of planes defined by 

r-H = n, H = h^G x + h 2 G 2 + faG 3 , 

where r is a vector from the origin of the crystal lattice, and hi and n are 
integers. Show that any vector 

R = aiOi + a 2 a 2 + a 3 a 3 (as are integers) 

satisfies the equation for a suitably chosen n, so that all lattice points lie 
on planes for various values of n. Show also that, if the hi are relatively 
prime (have no common factor), then for every value of n there are points 
on the plane. Show for this case that the planes of lattice points are 
spaced l/\H\ apart. This problem shows how the reciprocal lattice vectors 
can be used to determine systematically the various sets of planes upon 
which the atoms of a crystal lie. 



CHAPTER 15 



THEORY OF ELECTRON AND HOLE VELOCITIES, 
CURRENTS AND ACCELERATIONS 

1S.1 GROUP VELOCITY FOR WAVE-PACKETS 

In Figure 15.1 two sine waves are shown. They are in phase at crest A 
of the upper wave and B of the lower wave. At other points they are out 
of phase and tend to cancel. If many more waves of about the same wave 
length are combined to make a wave-packet, there will be a high degree of 

-x OX y* A ►, 




IP 




Vp + tfVp 



Fig. 15-1 — Reinforcement of Two Waves of a Wave-packet. 

cancellation except within a certain region, whose size may be adjusted 
mathematically, keeping it centered on crests A and B. The individual 
waves progress with phase velocity v p . If there is any difference in the 
phase velocities of the waves, the point of registration will shift so that the 
group will move with respect to the individual wave trains. Thus, if the 
lower wave moves faster by an amount dv p , then crest D will catch up with 
crest C in a time t = 5\/8v p . If the wave-length difference is small, this 
expression may be expressed by the derivative d\/dv p . In this time, crest 
A has moved a distance v p t, but the position of the reinforcement or center 
of the wave-packet has moved back an amount X. Hence the velocity of 
the group is 



(v p t — \)/t = v p — \dv p /d\. 



(1) 



This shows that if the phase velocity is independent of the wave length, 
the group and phase velocities are the same, a result valid for electro- 
magnetic waves in free space, or for acoustical waves of long wave length 
in air. 
The expression for v g may be put into the desired form by suitably 
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manipulating the derivatives. We write 

v p = \v, dv p /d\ = v + \dvjd\ (2) 

which reduces v g to 

v g = v p - \v - \ 2 dv/d\ = -\ 2 dv/d\. (3) 

By using the relationship k = 1/X we obtain 

dk = -d\/\ 2 , v g = A/rf* (4) 

which is the form used in Chapter 6, with v = &/k and k = P/h. 

In order to treat the three-dimensional case, we consider waves of the 
form exp 2iri[k ■ r — v(k)t] which can be combined to form a packet 1 

+ (r, t) = f {exp2«[fe • r - v(k)t]}w(k)dk (5) 

where / dk represents integration over ft-spaoe and w(ft) is supposed to 

be real, positive, smoothly-varying function of k with values different 
from zero only near some particular value k . The largest value of \J/ for 
any t will thus occur for the value of r where the exponential is most nearly 
independent of k, so that the integral over k adds together terms all of the 
same complex phase. This condition will be best met by satisfying 

= 7 k [k ■ r - v(k)t] = r - V*(k)t (6) 

for k = ko. This leads to 

r = V k v(k)t, (7) 

which is equivalent to the expression for the components of group velocity 
given in Chapter 6. 

15.2 APPLICATION TO ELECTRON WAVES 

Two differences must be considered in applying the results just obtained 
to electron waves. In the first place, we must show that the results are 
independent of the choice of the zero point or origin of energy. Thus, for 
example, if a potential were applied to the semiconductor as a whole, all of 
the electronic energy levels should be thought of as shifting together. 
However, the internal behavior of the electrons should be unaffected. In 
the second place, the electron waves are not plane waves, but are modulated 
combinations of atomic wave functions and this difference must also be 
considered. 

The unimportance of the choice of the origin of energy is readily shown 
from the Planck relationship v = &/h. Changing the origin of energy is 

1 In Chapter 17, k corresponds to 2x/X rather than to 1/A. 
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equivalent to adding a constant frequency v to all values of v. Since only 
the derivatives of v appear in v g , this added vo has no effect. 
The electron waves of interest are the Bloch waves 



\\, P = u P (r) exp {2wi/h[P ■ r - &(P)t]}. 



(1) 



Here (l/h)P plays the role of the wave vector k in the preceding section. 
Now if the wave-packet is made up of waves of almost the same value for 
P, Uj»(r) will be approximately the same function of r for all the waves and 
may be factored out of the integral over P-space, thus giving for the wave- 
packet 



+ = up (r) fexp {2wi/h[P ■ r - S(P)/]jw(P)aP. 



(2) 



The same reasoning applies as before and, if w(P) is large only near P 0) 
v|/ will be large only near 

r = [V P &(P)U Po t (3) 

giving a group velocity 



f = lt T = [Vi>S(i>)]atp °- 



(4) 



15.3 AN ANALOGY FROM CIRCUIT THEORY 

To elucidate the relationship between group velocity, density, and flow 
we shall treat the case of an artificial line with lumped constant inductances 
and capacitors. The relationship we shall establish is 

power transmitted = energy per unit length X group velocity. 
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Fig. 15-2 — Electrical Line Used to Show the Relationship between Power Trans- 
mission, Energy Density, and Group Velocity. 

As has been explained in connection with Figure 6.1, this case furnishes one 
example of a general result: that rates of flow associated with wave 
equations may be calculated either by a density flow method or fpom the 
group velocity. 

Figure 15.2 shows the line and the notation employed. We attempt to 
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find a solution of the form 

I? — If g27ri(na/X— vt) __ 7/ ^2ri(nka—vt) /in 

where k = 1/X and na is the position of the wth capacitor along the line. 
(These equations have been written to conform to the quantum-mechanical 
convention, with terms of the form exp ( — *W) ; this convention introduces 
a minus sign into iuL and iuC.) Accordingly, we obtain the following 
equations for the currents: 

In = iV n - F n+1 )/(-iuL) = V n {\ - e 2 * ika )/(-i u L) (2a) 

I n -i = (K-i - V n )K-iaL) = -V n {\ - e- 2T * a )/(-/<o£). (2b) 

Equating the difference in these currents to that furnished to the capacitor 
gives 

-iuCV n = In-i -In= (V n /-iuL){% 2 * ika - 2 + e ~ 2 * ika ) 
= (VJ-iwL){e™ ka - e -" ka ) 2 

= (AV n /iwL) sin 2 irka (3) 

This can be solved for sin irka giving 

sin wka = co VLC/A = w VLC (4) 

from which we conclude that the line can transmit up to ■xv'VLC = 1 or 
v — \/tt VLC but not above, since higher values of v would require complex 
values of k. At the cutoff, sin irka = 1 and irka = ir/2 or 90° ; thus the 
phase shift is 180° between unit cells as discussed in connection with 
Brillouin zones. 

In the pass band, the group velocity is 

dv d sin irka a cos irka 

Ve = 7k = 7kWTc r= VTc ' () 

showing that the group velocity varies from a/V~LC at low frequencies 
(that is, for ka <€. 1 ) to zero at the cutoff frequency when irka = 90° and 
cos irka = 0. 

Next we shall calculate the power flow down the line and the energy 
density along the line. The rath capacitor has a voltage 

Real V n = Vq cos (2irnka — cat), (6) 

leading to an energy of 

{CF 2 /2) cos 2 (2wnka - a>t) (7) 

which has peak and average values of CV 2 /2 and CF 2 /4i respectively. 
The energy of the current /„ is similarly obtained from the amplitude of 
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I n - The complex form of I n may be rewritten as follows: 
In = V n {\ - e 2 " ka )/-ic*L 

= F n e" ka (-2i) sin irkaf-icaL 

= (2F n /uL)e" ka sin irka 

= (2F /coL) sin irka ■ e 2 "^"-^ + * ika (8) 

which is evidently a current of peak value 

(2Fo/ccL) sin irka, (9) 

with a phase lag :r£« behind V n , The energy stored in the inductance thus 
has a peak value of 

%L(4F 2 /cc 2 L 2 ) sin 2 irka = |Z.(4F 2 /co 2 L 2 )(w 2 ZC/4) 

= I W, (10) 

which is the same as that stored in the capacitor. Thus the energy of the 
line is \CV 2 on the average for each capacitor and each inductance or 

CV 2 /2a per unit length. 

If this energy flows with the group velocity, the power transmitted will 

be 

CF 2 cos^ka V 2 {C/L)* cos irka 

— a — (11) 

2a Vic 2 

A direct calculation of power flow looking into the inductance which 
carries /„ is obtained from the amplitudes and phase angle of I n and V n . 
This leads to an average power flow of 

\V peak • / peak • cos (phase difference) =\V§- (2Fo/a>L) sin irka ■ cos wka 

F 2 a, VIC 



cos wka 
wL 2 

F 2 (C/L) H cos irka 



(12) 



in agreement with the other form. 

Our conclusion, in this particular example, is that consistent results are 
obtained when power flow is calculated (1) directly from the conventional 
expressions for power input at one point on the line and (2) from assuming 
that the energy stored in the line flows along the line with the group 
velocity. 

15.4 THE PROBABILITY CURRENT AND BLOCH 
FUNCTION CURRENT 

In this section we shall present the theory of the current operator and 
evaluate the current flow for electrons in the Brillouin zone. 
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The operator relationships p x = (h/2iri)d/dx and 3FC = (ih/2ir)d/dt are 
the starting point. Consider a volume V in space bounded by a surface S 
in which the one-electron wave function \|/(.v, y, z, t) = \|/(r, /) satisfies 

Xty = («A/2ir)i|r. (1) 

Then the total probability current out of V is given by the rate of decrease 
of the probability that the electron is in V\ 

~ ^ fvW xd y dz = - \ f($*Xty ~ ^3C^*)dxdydz 

' 2 -f(i£)i^^ ~ +*V)**A (2) 

since 

W --■£-?* + e UL(x,y,z) (3) 



and the integrals with ^ cancel. Noting that V • (AVB — BVA) = 
AV 2 B — BV 2 A and letting dS be directed along the outward normal of S, 
we can reduce the integral to 

-ih 
4irm< 



- / V • (iff*V\|/- Wty*)dxdydz = r— r— . / (\I/*V\J/ - t|/Vv(;*) • ^S. (4) 

! «/S 2W2 27Ti «/s 



Thus the probability decreases as if there were a probability current of 

/ (prob) = — Hf*pyb + vlf(^)*] = (l/m) real part \\>*pty (5) 
2m 

flowing across S. [For a plane wave of momentum p , this gives 
(l/m)po\y\r\ 2 just as if the probability density |\J/| 2 were flowing with velocity 

po/m.] 

In order to gain insight into the nature of the current expression, it may 
be helpful to consider the following results: If v|f(r, t) is energy eigen- 
function, it takes the form \|/(r) exp (2T&t/ih) so that |\}r| 2 is independent 
of time. This is consistent with the result that for this case V ■ / (prob) 
contains (\|r*V 2 \J/ — \j/V 2 x|/*) as a factor which vanishes when \|/ is an 
energy eigenfunction. 1 For the one-dimensional wave function of Figure 
5.7(c), the node in |\|/| shows that at that point the current vanishes; 
and hence, since V • / = 0, / vanishes at all points. 

We shall next evaluate the current expression over one face of a unit 
cell and show that it reduces to the group velocity expression for average 
velocity. 2 Since / (prob) involves products of \\i and \|/*, it is evident that 
a wave function of the Bloch type gives the same current for each unit 

^hy? See Section U.S. 

2 H. Jones and C. Zener, Proc. Roy. Sec. London, 144A, 101 (1934). 
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cell since the changes in the exponential terms cancel. Suppose the unit 
cell has vector edges a, b, c. We choose the #-axis perpendicular to the 
(ab) face and evaluate the current across this face. This will obviously 
give us the ^-component of current. For this case, the Bloch function 
is a function of P and r, and S depends on P. Differentiating the 
Schroedinger equation with respect to P x gives 

or 



36 r - 2 

dP, 



In the following equations the right side of this equation will be manipu- 
lated so that it is expressed in terms of the probability current. Multi- 
plying by \|/* and integrating over the unit cell C gives 

(and since if 3C\|r = 6i|/, then 3Ci|»* = 6t|r) 

8ir 2 m JcV dP x dP x Y / 
%T?mJc y V dP x dP x T / 

= _4-f(Vv^-^v**y</s. (8) 

Z**mJsV v aP x dP x ^) K ' 

The last integral is taken over the six plane surfaces of the unit cell. As 
we shall see, the contributions on the faces not perpendicular to x cancel 
out, since the integrand is periodic, and even for the two (ab) faces, there 
is considerable cancellation. To evaluate the integral, we write 

i|r = exp [2iri(P • r)/h]u P (r) (9) 

d\|/ 2*7*. , [" .,_ w ,"| du P (x) 

dx dP x hdx^h*dx v \_ "A dP x 

From the form of di|//dP x it is evident that the integrand is periodic for 
cell edges a and b (which have no ^-components) so that only the (ab) face 
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makes a contribution and for it only the ^-component of V is needed. 
For the (ab) face, all terms are periodic except those with the factor x. 
Since x is larger by an amount c x for the face of larger x, the net contribu- 
tion is given by: 

dP x Jc U 2 mJabV h dx h ^ dx ) J 

Aittm Jab\ dx dx / 

= c x I Ix (prob) dxdy 

*Jab 

= c x a ■ ^[Average I x (prob) over (ab) face]. (12) 

Hence the average probability current over the (ab) face is given by 

^J fr^V/c x (a-b). (13) 

ar x "C 

This can be interpreted by saying that the average probability density 
I / -ty*tydV 1 / volume of cell 

flows across the x = constant face with an average velocity v x = d&/dP x . 
Similar results for the other edges lead to the conclusion that v = Vp&(P). 

15.5 SOME GENERAL PROPERTIES OF QUANTUM- 
MECHANICAL WAVE-PACKETS 

As we have discussed in connection with Chapters 7 and 8, the wave- 
packet concept of the behavior of holes and electrons is essential to under- 
standing the basic physics of transistor action and related topics in semi- 
conductors. This appendix has been written as an introduction to a 
number of properties of wave-packets, which, although well known, are not 
readily available in textbooks. We shall describe first a minimum-uncer- 
tainty wave-packet. We do not give its derivation, however, since this is 
available in references quoted later. After that, we shall develop some 
general methods of modifying wave-packets and shall finally apply these 
to a study of the wave-packet shown in Figure 6.2. 

One of the most publicized features of quantum mechanics is the Heisen- 
berg uncertainty relation. We shall discuss the meaning of this relation 
in terms of wave-packets. A wave-packet, like that shown in Figure 6.2, 
for example, has a certain extension in space. Accordingly, it has a certain 
expected mean square deviation for measurements of its position. Formu- 
lae for calculating these are given in equations (11) and (14) of Sec- 
tion 14.3. Suppose that by the application of these we evaluate x 2 — x 2 = 
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(x — x) 2 = (Aa?) 2 so that we can call Ax the uncertainty in position of the 
electron for the wave-packet. There will also be a spread in expected 
momentum values from the mean value p. If we for simplicity sup- 
pose p = 0, t hen the mean value of (p — p) 2 , called (Ap) 2 , is simply 
p 2 = 2m K.E.; we have shown in equation (9) of Section 14.4 that this 
quantity is positive unless \|r is constant — an impossible condition for a 
wave-packet. Straightforward calculations using the relationship 
xp — px = ih, equation (20) of Section 14.4, lead to Heisen berg's uncer- 
tainty relation, 

ApAx > h/2 (1) 

for any wave function. Thus, if the wave-packet is made more compact, 
so as to decrease Ax, then Ap must increase. Mathematically, this follows 
from the fact tha-t, if the wave function is made more compact, by simply 
compressing it along the ^c-axis for example, its derivative with respect to 
x will be greater and so will p 2 . 

It is possible to construct a wave-packet for an electron in field-free space 
which has the minimum value of ApAx for any particular time, say / = 0; 
however, it is found that thereafter the packet tends to spread. The 
equation for a normalized wave-packet, centered at x = 0, having p = 0, 
and having a minimum value of ApAx, is: 



exp 



[" 



i / a <- ^a (a + tut), 

\L-K)' i \a + tut) 71 



(2) 



where 

a = Ax for / = (3) 



h _ Ap 
2m Ax m 



(4) 



The quantity u has the dimensions of velocity. This equation can be 
derived by analogy with the diffusion equation, the Schroedinger time- 
dependent equation for the free particle being manipulated as follows: 



dt Y 2m \c) dt 2 



dti h a 2 \|/ a 2 Ur 

— - = — - — - — - = D — — (St 

dt 2mi dx 2 dx 2 P; 

where D = (h/2mi) has the dimensions of a diffusion constant. 1 The 

1 This method is discussed by S. Dushman, Elements of Quantum Mechanics, John Wiley & 
Sons, New York, 1938, p. 405. 
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more conventional method is to resolve the i\t(x, 0) minimum packet into 
momentum eigenfunctions, whose time dependence is simply exp (tp 2 /2mhi), 
and obtain the time dependence of \|/(.v, t) as a consequence. 2 

We shall next derive some general theorems pertaining to all wave- 
packets for free electrons, after which we shall remove the limitation 
p = 0, which holds for packet (2), so as to set it in motion and obtain the 
packet of Figure 6.2. 

First of all, we shall prove the time-reversal feature of Schroedinger's 
equation which shows that for any ty(x, t) another \|;, called ty-(x, t), can 
be found for which | v(r_ (at, t)\ 2 = \i\i(x, — 1)\ 2 . Thus \|f_ gives the same 
behavior for the probability density as does t)/ except that time is reversed. 
The proof is very simple: We denote the function obtained by differentiat- 
ing \\r(x,t) in respect to its second argument by x( x y0- We then have 

dilr 

ih — = ihx(x,t) — D^vjr^, t). (6) 

dt 



Let 

Then 



\|/_(at, /) = \\t*(x, — t). 

«*^fl. a *l!^.-« x . ,,_ fl . ,7) 



However, from the complex conjugate of equation (6), we have 

-ihX*(x, -t) = 3Ci|r*(*, -/) = !X\J/_(.v, /) (8) 

where we have used the relationship (3G|r)* = !JC(v|/*) which follows from 
the fact that 'DC has no imaginary terms. From this equation and (7), 
we obtain 

a^r = rx\|,_ (9) 

at 

which was to be proved. We shall show next that a wave-packet increases 
its value of Ax from its minimum value symmetrically for plus and minus 
time, a result which would appear unreasonable if Schroedinger's equation 
did not possess the time reversal feature. In particular, we shall prove 
that, for any wave-packet, Ax varies according to the equation 

~2 -2 

(A*) 2 = V — -2- (t - t ) 2 + (Ax) mia 2 (10) 

Im 

where /n is the instant at which Ax is a minimum. This equation shows 
that the packet spreads in space symmetrically to both past and future 

2 A very readable discussion of this procedure tqgether with a derivation of ApAx > h/2 
is given in L. I. Schiff, Quantum Mechanics, McGraw-Hill Book Co., New York, 1949. 
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times measured from t . This result is proved from the expression for 
(A*) 2 , equation (13) of Section 14.3, and the formula for the time deriva- 
tive of an operator, equation (11) of Section 14.4. We have 

dx p 

T, = ~~ (11) 

dt m v ' 

d (d \ Id 11 



/ m dt m th r ' 



ItWJ-mli'-mlkW-Kp)-** (12) 

since 3C = p 2 /2m and DCp = pVC = p 3 /2m. Next we evaluate dx 2 /dt as 
follows 

s*\7)j£;<"V-* v > 



(2) 2ihm 

= -w 

(3) 2ihm 



WO L ft\ i 

itoT\ 2xp + l)tom i * p + px) (13) 



where equality (1) is obtained directly from formula (11) of Section 14.4; 
equality (3) by simple algebra; equality (4) from the equation (20) of 
Section 14.4 /w = xp + (h/i); equality (2) from repeated applications of 
(20) such as 

p n x = p^px = p^ 1 (xp + -)= p»-i xp + ^-i 

= p n ~ 2 xp 2 + 2 - p"- 1 = etc. = xp n + n -. p n ~\ (14) 

Proceeding similarly for d 2 x 2 /dt 2 , reducing all terms containing p n x to xp n 
terms by (14), we obtain 

d 2 x 2 d (px + xp\ 1 

if = it \nr~) = m^ [(J)x+ xp)p - p2 ^ x + *pn 
= d?("* 3 + i* 2 + ^-«p 3 -^p 2 -^p 2 ) 

1 / 4ft\ . 2p 2 

Since dp 2 /dt = 0, it is evident that d 3 x 2 /dt 3 = 0. 

If the foregoing expressions are averaged in integrals of the form 

x = I \\i*x\\idx (see RC. 3 in Section 14.3) we obtain: 

d p 

— x = — = v (constant in time) (16) 

dt m ' v ' 
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d -* 1 -; ; - 

— x jl = — (px + xp) = xv (17) 

at m 



dt 2 m 2 



= 2d 2 (constant in time). (18) 



The higher time derivatives of x and x 2 vanish as do all time derivatives of 
p andp 2 . Since (Ax) 2 = x 2 — x 2 , we can write 

d , s9 dx 2 „ dx 
-(^.--a-.S-JS (19) 



dt 2 m 2 m 2 



These equations show that (A*) 2 is a quadratic form in /, the coefficients 
of which may be expressed in terms of values at t = (like x 2 ) as follows: 



(Ax) 2 = ^- t 2 + (xv - 2xv) t + x 2 - x 2 . (21) 



(A*) will have its minimum at the time, call it t m , for which 

^ (A*) 2 = 2 (Ap) 
dt m 



= -^- = 2 ^- t m + (Tv - 2xi>) . (22) 



This equation may be solved for t m and the minimum value of (Ax) 2 
obtained in the form 

(Ax) min 2 = —4~ t m 2 + (xv- 2xv) t m + x 2 - x 2 . (23) 

m 

In terms of t m and (Ax) m - la 2 , the expression for (Ax) 2 reduces to 

(Ax) 2 = ^4- (' ~ tm) 2 + (Ax) min 2 . (24) 

m 

The equations just derived show that the center of gravity of the packet 
moves with constant velocity v = p/m and that it spreads in space as 
expected for a spread in velocities (Ap/m). However, the Ax of the packet 
never decreases below a minimum value. This may be compared with the 
wave-packet of (2) of which the exponent may be rewritten to give: 

[ix 2 ut x 2 ~\ 

mx n - + 4 * ( * 2 + ^ " 4( * 2 + ^ nn 
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so that v|/*\J/ is 



!+(*> 0| 



CXP |_ 2(a 2 + u 2 t 2 )\ 



VIt (a 2 + u 2 t 2 ) 



2,2nM 



[1S.S 



(26) 



which represents a normalized Gauss error function probability distribution 
with (Ax) 2 given by 



(Ax)' = (a 2 + u 2 t 2 ) = (A*) 2 + 



\m / 



(27) 



where the values of a and u from (3) and (4) have been used. 

We shall next show that a wave-packet in free space may be given a 
Galilean transformation ; that is, the whole solution may be given a velocity 
of translational motion and remain a solution of Schroedinger's time- 
dependent equation. This result holds in general for any solution of the 
one-dimensional equation, 



1 d 

— p 2 ty(x,t) = ih-ty( x ,t). 



The necessary transformation is 



(28) 



(29) 



where the added velocity of motion is pi/m as may be seen from the fact 
that the new probability density is 

+ (*--/,/) » (30) 



!<t>l 2 = 



which is simply the old probability density being translated with velocity 
pi/m. It may readily be verified that <}> satisfies the time-dependent equa- 
tion. Writing <|> = exp (tA)\\t for brevity, we find, since p exp (iA) = 
[exp (/'^)j (pi +p), that 



1m 



P% = ~p[e iA Pi + e iA p]y\, 



JA 



1m 



[Pi 2 + Ipip + p 2 ty. (31) 



In evaluating ihdfy/dt, we must note that / occurs three times in <j> and that 
ih times the derivative with respect to / of the first argument in iff gives 
simply [pi/m)p^. Thus we have 



ih * = •« 

dt 



[f + ^ + f]*, 

\_lm m Imj 



(32) 



the derivative of the last argument giving (p 2 /1m)\\i, since \|/(#, /) satisfies 
Schroedinger's equation. Thus we see that (p 2 /1m)$ = ihd^/dt so that 
the packet can undergo a Galilean transformation, Q.E.D. 
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We shall next compare the values of 3C for 4> and \}r. We have 

3C =f+*X+Jx = f e~ iA ty y~ [pi 2 + 2pip + p 2 \i?dx 

= i J* ^^ + 2pip + p2] ^ dx 



= hi bi2 + 2pip +p2] = h [{pi + p)2]> (33) 

where the averages are taken over \|/. This represents simply the energy 
to be expected if the momentum distribution represented by \j/ were simply 
increased by pi. If p = 0, the added energy is simply that of the motion 
of the packet of mass m with velocity v = pi/m. From this it is evident 
that setting the packet in motion does not change (Ap) 2 : 

(Ap) 2 = av. over $ (p 2 — p 2 ) 



= av. over ty [(pi + p) 2 - (pi + p) 2 ] 

= av. over \|/ [p 2 + 2pip + p 2 — p 2 — 1p x p — p] 

= av. over \|r [p 2 —p 2 ]. (34) 

Since the probability distribution |<j> 2 | is simply that of |\}r| 2 set in motion, 
(30), it is evident that (j> has the same (Ax) 2 and (Ap) 2 values as does vj/. 
We can now apply this transformation to the minimum-uncertainty 
packet in order to obtain the wave-packet used for illustration in Figure 6.2. 
Applying (29) to (25) and introducing the notation 

pi/h = k, pi/m = v, x = y + vt (35) 

so that y measures distance from the center of the wave-packet, we obtain : 
$(x,t) 

(2,)*(« + ,«0« CXP ' L *V + ~2) + 4a(a 2 + u 2 t 2 )i (36) 



ex P L 4(a 2 + ~ 2 > 2 " I r ' -* ~ 2 ' 



From this we can see analytically several of the interesting results shown 

in Figure 6.2. Near y = 0, we see that pty = r k\\r = pity. However, 

near the leading edge or the back edge of the packet, corresponding to 
y = ±(a 2 + u 2 t 2 )^ respectively, we have 



. h 



i |_ 4«( 



lut "1 

a 2 + u 2 t 2 )*] * 



(37) 
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so that for ut ^> a, the momentum operator gives 

pty = (pi ± — J ty = (pi ± Ap)ty. (38) 

This variation of momentum between the front and back of the packet 
can readily be seen in Figure 6.2. 

1S.6 THE ACCELERATION OF AN ELECTRON WAVE-PACKET 
BY ELECTRIC AND MAGNETIC FIELDS 1 

In the text we stress the possibility of treating holes and electrons as 
particles which may be accelerated by electric and magnetic fields and 
scattered by irregularities in the crystal. The basis for this picture is that 
wave-packets for holes and electrons can be prescribed which move, to a 
good approximation, in accord with the classical picture. The relationship 
between velocity and momentum, however, is that discussed in Chapter 6 
and between force and rate of change of momentum that discussed in 
Chapter 7. We shall establish these results for an electron wave-packet 
in this section and consider the modification for holes in the following 
sections. 

The theorem which we shall prove is as follows: If we construct a wave- 
packet which represents a localized electron with a wave function corre- 
sponding to momentum P in an energy band, then this wave-packet will 
move with velocity v = V P &(P) (a result previously established in Section 
15.2) and will change its momentum P according to the formula 



P = (- e )[B + - c VXH~\- 



(!) 



This result is in accord with the discussion of Chapter 7. 

This theorem proves results which are partially derived in various texts 
in other ways. One of the simplest ways, although not rigorous, of reaching 
this conclusion is to make use of the group velocity formula and the 
principle of conservation of energy. 2 If the force on the electron is F, one 
then equates the power input to the electron to its increase in energy. 
This gives 

v ■ F = 4 &( p ) = p ■ VpS(P) = P ■ v 
at 

which is satisfied by 

P = F. 

1 The earliest references to proofs of the theorem derived in this appendix are Sommerfeld 
and Bethe, Handbuch der Physik, Vol. XXIV/2; H. Jones and C. Zener, Proc. Roy. Soc. 
London 144, 101-117 (1934); H. Frohlich, Theme der Metalle, Julius Springer, Berlin, 1937. 

2 F. Seitz and N. F. Mott and H. Jones follow this procedure in their books. 
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However, it is equally well satisfied by P = F + (l/|i;|)/'' X v or any other 
vector whose component parallel to v is also F. Furthermore, the later 



use of F = (■ 



-<?) \E -\ — y X -ff represents an additional assumption 

since the forces due to H do no work. 

A derivation of the acceleration law for electric fields using momentum 
eigenfunctions has been given by J. C. Slater. 3 A somewhat similar treat- 
ment including the effect of magnetic fields is given by A. H. Wilson. 4 
However, neither of these references treat wave-packet behavior explicitly 
and it is easier for our purposes to start initially with the wave-packet. 

The method we shall use is based upon obtaining an approximate solution 
for Schroedinger's equation for an electron in electric and magnetic fields 
and showing that this solution does behave in the predicted way. The 
treatment which we shall give is similar to one due to Houston, 5 except 
that we include magnetic fields and deal with a wave-packet. As an 
introduction we shall describe briefly the essential feature of Houston's 
method. We consider an electron in a wave function of the Bloch form 

\|/ P = exp [i(P ■ r)/h]\x P (r) 

where Up(r) has the periodicity of the lattice and is normalized per unit 
volume. 6 This wave function satisfies the equation 

\]tn p2+G[i (r) ] *' S ^ P = S {P) * p - (2) 

If an electric field E, producing a force — eE and a potential energy eE ■ r, 
is applied for time dt, the change in \|r is 

dtyp = j % dt 3ty P = | [S (P) + eE ■ rty P (3) 

so that at / + dt, \[r has become 

{1 - i[S(P) + e& ■ r]dt/h}ty P = exp {-i[&(P) + eE ■ r]dt/h\ty P 

= {exp [-i&(P)dt/h]} {exp [i(P - eEdt) ■ r/h]\u P (r). 

This shows that in time dt, the effective P for \|/ has changed by ( — e)Edt 
just as if P = (-e)E. Houston finds that Up also changes to U P _ eEdt so 
that the wave function as a whole varies according to P = (-e)E. Except 
for fields so high that in one lattice constant the electron gains an appre- 

3 J. C. Slater, "The Electronic Structure of Metals", Rev. Mo J. Phys. 6, 209-280 (1934). 

4 A. H. Wilson, The Theory of Metals, Cambridge at the University Press, 1935. 
6 W. V. Houston, Phys. Rev. 57, 184-186 (1940). 

6 The +p used in Chapter 17 is normalized for the volume inside the periodic boundaries 
and is 1/ V V times the vfp of this section. 
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ciable fraction of the energy required to jump the energy gap between 
bands, Houston shows that typ- e Et is a good solution. Our method will 
also consist of finding the change in time dt (which we shall express as a 
derivative) but we shall deal with a wave-packet and also include the 
effect of a magnetic field. 

In the presence of electric and magnetic fields, Schroedinger's equation 7 
for an electron with charge — e contains the scalar and vector potentials 
<p and A and is 



where 



\h ^ ~ Pa)2 + ^ + { ~ e)<p ] + = sty - fojt* 



(4) 



t>A=^-A. (5) 



The scalar and vector potentials give electric and magnetic fields E and H: 

E=-V v --?-Ai H = 7XA; (6a) 

c at 

the scalar and vector potentials are not unique and one may transform 
them by adding to <p a term { — \/c)dx/at and to A a term Vx without 
changing E and H, If we suppose that <p arises from a stationary system 
of charges and that A arises from changing currents and charges outside of 
the region considered, then the gauge transformation function x m ^Y be 
chosen so that V • A = 0. We shall suppose that x has been so chosen 
and shall use the resulting equations 

V • A = and p ■ p A = Pa • P (6b) 

in the subsequent analysis. 

In the presence of a magnetic field it is impossible to have wave functions 
which are eigenfunctions for the velocity operator r. In fact we find, by 
using pxX — xp x = h/i, that in the presence of a magnetic field the operator 
corresponding to r is no longer p/m ; and the new operator for t has such a 
form that no wave function can be simultaneously an eigenfunction for the 
three components of r. Using the commutation relation p x x — xp x = h/i 
and the expression (1 1 ) of Section 14.4 for the time derivative of an operator, 
we obtain 

r = (l/ift)(i3C - 3Cr) = (P - p A )/m. (7) 

For a uniform field with H = i z H z , a possible choice for A is i v H z x and for 
this it is evident that 

xy = t — t — + - HzX = yx + - H z (8) 

\_t dxj\_t ay c J ic 

7 See, for example, L. I. Schiff, Quantum Mechanics, McGraw-Hill Book Co., New York, 
1949. 
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so that x and y do not commute and, therefore, no wave function can be 
an eigenfunction for both x and y. This fact prevents dealing with in- 
definitely extended wave functions, such as Houston used, in the presence 
of a magnetic field; however, it does not interfere seriously with using 
wave-packets. 

We must next consider the problem of constructing a wave function 
which represents an electron moving in a magnetic field and having the 
attributes associated with \|rp, a wave function in the energy band for the 
case of no magnetic field. A clue to the solution is furnished by the gauge 
transformation of \|; which accompanies that of A and <f>. Thus if y\r satisfies 
the equation 

\jtn {P ~ Pa)2 +G{L+ ( ~ e)(p ] + = ih Jt^ 

then the wave function 

i|/' = e *' (- ' /e)x/ *ijf (9) 

satisfies the equation with 

A' = A + V X , <p' = <P--j t x, (10) 

c dt 

as the reader may verify by substitution. If we wish to center the wave- 
packet at r , then we may regard Pa (i"o> t) as arising in part from a gauge 
transformation with x = A{tq, t) • r = (c/ — e)pA(ro, t) • r. This reasoning 
leads us to investigate the properties of 

v|/ = e* Aorr/i S|f,(r) J (11) 

where Pao — pAo(t) = PaItqU), t\. It is evident (A) that y\r gives the 
same change density \^]fp\ 2 as does y\r P . We shall also verify (B) that, near 
r , the wave function (11) is nearly an eigenfunction of 3C with the energy 
appropriate to state P in the absence of a magnetic field, and, furthermore, 
(C) that the average value of the velocity operator r = (p — pA)/m, 
averaged over the unit cell at r , is simply v(P) = V P 6>(P) appropriate to 
an electron in state P. Thus near To the wave function (11) represents 
properly the state of motion described by P and v(P). After verifying 
assertions (B) and (C), we shall build a wave-packet based on (11). 

To prove (B), we evaluate 3C\|f, by carrying out the operations only on 
the first factor in equation (11): 

Wy=t*^/*^ + (- e ) v + ±(p A0 -p A )-p + -±-(p A0 -p A )*^ P (12) 

where we have used the fact that both V • Pa and V • Pao are zero in 
arranging the third term. At r = r , Pao — Pa is zero so that (12) gives 

3ty = e^^flFCo + (-*)«#!- = e*"^^) + (-«)«#p 
= [5(F) + (-«)«# 
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so that, at r , xjf is an eigenfunction with energy S(P) due to its state in the 
band plus (—e)<p due to its interaction with the added electrostatic po- 
tential. When r differs from r , there are changes in <p and, as we shall 
see, these affect z'ftifr just as — E ■ r did in Houston's method. Also there 
are changes in p A0 — Pa which produce the magnetic effects. The term 
in (Pao — Pa) 2 is negligible compared to the term in (p AQ — p A ), provided 
that the size of the wave-packet is not too large. We shall show this by a 
rough estimate of the two terms, using for the operator^ the average value 
for a thermal electron and writing the radius of the packet as n times the 
lattice constant a and taking A = i y H z x. The quadratic term is then 

2^|_— J =2 (wBa) > Where " = -^" (13) 

The linear term is obtained by writing |^[ = mv where v is the thermal 
velocity, thus obtaining 

1 eH z na 

mv = m(o>na)v. (14) 

m c v ' 

The ratio of (14) to (13) is v/wna = (v/a)/na, the latter form being 
(v/co = radius of circular orbit in field H z ) divided by (radius of wave- 
packet). It is physically evident that, if this ratio is small, the motion of 
the electron will consist of a circular oscillation which is smaller than the size 
of the wave-packet, and in this case the internal motion of the packet may 
be more important than the bodily motion. For a thermal electron, how- 
ever, v = 10 7 cm/sec and for a field of 10 3 gauss w = 10 10 sec -1 so that the 
radius of the orbit is 1CT 3 cm and the two radii are equal for na = 10~ 3 cm 
or n = 10 5 . We shall deal with packets much smaller than this and shall 
neglect the quadratic term. 

The value of t, averaged over the cell of r , is 

M f V- (P-Pa)WV 
(a) J ceiio T m T 

V/eell at r — ~ 

/ WdV 

J cell T ' 
( b)/ce„0*'*^ + ^-^^ (c) 

=v(P) (15) 

ty f \ 2 dV 



f 

J cellO 



where equality (a) follows from (7), (b) from the operation of p on the 
exponential factor of (11), and (c) from neglecting p A0 — p A compared to 
p and using the results of 15.4. Hence assertion (C) is correct, and the 
wave function \\i represents correctly the average velocity v. 
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Wave-packet. We next construct a wave-packet by multiplying \|/p by 
a weighting factor w(r, /) and a time factor exp [ — //(/) /ft]. We shall 
suppose that w and / may both be chosen to be real functions so that / 
contributes only to the complex phase and w simply gives the shape of the 
packet. We shall show later in equations (31) and (32) that such a choice 
gives a solution of Schroedinger's equation. Thus we write 

iff = e i[pA "- r - f]/fi w(r, t)ty P (r) = e*wx|/ P) (16) 

where the last form introduces abbreviations used in the subsequent 
analysis. The normalization condition on i|/ is 



1 =J \^\ 2 dF =J \^ P \ 2 \^\ 2 dV =J | 



r 



since we shall suppose that \\t P is normalized per unit volume and that w is 
slowly varying so that we may replace \i\fp\ 2 by its average value unity in 
the integrand. We shall suppose that w is a bell-shaped function, such as 
exp [— (r — r ) 2 /(na)% so that Vw = at r and the center of the packet is 

= (r\w\ 2 dV. (17) 

An alternative procedure would be to use, in place of w, a sum over i[fp; 
the two results would be equivalent, however, and the w method seems 
more convenient. 

We next operate on ty by 3C. Proceeding as for (12) and neglecting 
(Pao — Pa) 2 , we obtain: 

3ty = e" [^ [p 2 + 2(p A0 -p A ).p] + <m+ (-*>] wfp 

(a) (b) (c) 

w[3C + {-e)<p\if P + - (p A0 -p A )- (yiptyp + typpw) 

(d) (e) 

+ 1^w^p + ^^ 2 w]. (18) 

(The letters over the terms are for later reference.) 

At r = r , we again obtain !JC\|/ = [&(P) + ( — e)<p]ty except for term (e), 
which we shall show is negligible to a first approximation. Similarly the 
average value of t is again nearly v over the cell at r ; we shall show later 
that the whole packet itself also moves with the same velocity. 

Except for terms (b) and (d), "X\\> consists of slowly varying functions 
of r times i|/p. Terms (b) and (d), however, which contain pty P , have 
behavior in each unit cell quite different from ty P . They are, however, 



= e* [■ 
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still functions of the form exp [i(P ■ r)/h] times a periodic function of r. 
They can, therefore, be expanded in a series of wave functions ^;p(r) 
coming from other energy bands than does \|/p(r). If we suppose typ(r) 
comes from band "zero", and omit the subscript zero for simplicity so that 
tyop(j) = typ( r )> then we can write 

P^p = E (M0)i|r tf (19) 

where 

(i\p\0) = J ty iP *pty P dV (20) 

integrated over unit volume. The term for i = is 

(0|j>|0) = mv = mVpS(P) (21) 

in accordance with Section 15.4. We shall show that the contributions of 
t|/*P are negligible for electric and magnetic fields of ordinary magnitudes. 
We shall postpone the proof of this until the end of the section and in the 
intermediate steps will omit effects depending on v|f;p(r). 

Next we shall show that (c) and (e) are negligible compared to (b) and 
(d) respectively. Considering only the (0|^|0) term in (b) we have, for 
order of magnitude, 

(b) : (c) = wxl/pwy : wxlfp — = mv : h/na; (22) 

na 

mv for a thermal electron corresponds to a wave length of about 50 ang- 
stroms and hence to mv = h/SO angstroms. Thus if na > 50 angstroms, 
meaning that the wave-packet is many wave lengths in size, (c) -C (b). 
Precisely similar estimates apply to (d) and (e) so that (e) may be regarded 
as negligible. As in Section 15.5, the term (e) contributes to the spreading 
of the wave-packet. However, we shall neglect the spreading effects in 
this treatment. Thus we may write for the approximate value of X\|/: 

(a) (b) (d) 

UCiJf = e ie |wpC + (-*)##, + ~ (Pao - Pa) • P^p + - pw ■ pty P ~\- (23) 
L m m J 

We have left 3C as an operator so as to be able to use (23) later to estimate 
the contribution of the ij>ip terms. 

We shall next evaluate ihdty/dt. In this we need to consider two terms 
specially: 

Q t PAo(t) ■ r = -p A [r (t), t]-r = (f„ ■ V)p A ■ r + -p A ■ r 



= [Vo-V)Ai + ^Ai] -r (24) 
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where the derivatives in the last line are obtained by writing Pa as pA (t, t) 
and evaluating the result at r = r as suggested by the subscript. The 
other term is 

ih - \|/ P = ihP ■ Vp\ffp = ihP • 7pe i(p ' r)/K Up 
dt 

= -P ■ rty P + ihe i{p - r)/ *P ■ VpU P (r). (25) 

The last term has the periodicity of \|fp and can, like ptyp, be expanded 
in \|>ij>: 

e<w-'>"v,u,(r) = Z ('IVpIO)^-, (26) 

where, by a convention different from (20) above, we let 

(«|Vp|0) = J u iP *Vpa P dV. (27) 

These terms can be re-expressed 8 in terms of (/|i>|0); this is unnecessary 
for our purpose since rough qualitative features suffice. An upper limit 
for («|Vp]0) may be derived as follows: AsP varies from the center of the 
Brillouin to a point on the side, the wave function u^ will acquire approxi- 
mately one additional wave length (see Figure 5.7, for example). As a 
result Up will change by approximately 100 per cent while P varies by 
h/2a, the half-width of the Brillouin zone. Hence VpUp will be of the 
same order as (2a/h)xx P . This leads to the conclusion that (/|Vj>|0) will 
be no larger than about 2a/ h. Actually VpUp will not vary in the same 
way as either Up or Ui P so that the integral of the product may be much 
smaller than (2a/ h). This estimate is very crude but will suffice for our 
purposes. We can also conclude that (0|Vp|0) is a pure imaginary by 
showing that it is the negative of its complex conjugate. This conclusion 
follows directly from the fact that the wave functions are normalized for 
all values of P so that 

= V P fu P *UpdF = (0|Vp|0)* + (0|V/.|0). (28) 

We may now obtain ib(d\\t/d() in terms of (24), (25) and (26): 

(g) (h) 

ih j t i|r = e ie j/w - £ (f • V)Pa + j ( Pa + p] • rw+j. + M^r 

(k) 

+ ihP ■ £ (*|v,»|0)x|/ J. (29) 



8 A. H. Wilson, The Theory of Metals, Cambridge at the University Press, 1936, paragraph 
2.81. 
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) 


(a 2 ) 


w[S(P) + (-e) v ] + (r - r ) • [(-<?) V*> - V(j 


(go) (gl) 


= /w - 
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(r ■ V)A» + - Ai + P 
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(k) 
/ftP • (0|Vp|0)w. 


• [r w + (r 
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Again terms from other bands appear but with small coefficients. We 
shall neglect at first these terms and estimate them later. 

In order to find the changes in the parameters /, w, P, and r (which 
depends on w) of the wave-packet, we multiply 3Ci|/ = ihd-ty/dt by 
exp ( — id)\\i P * and integrate over a unit cell at r, considering all slowly 
varying functions to be constant and to have the values corresponding to 
r. The integrals of \ji P with i|/j/> are zero and with \j//> give simply the 
volume of the unit cell, which can be factored out. In this way we obtain 
from 5KT\|/ and from ihdty/dt, respectively, the left and right sides of the 
following equation: 

(b) (d) 

V(Ai • y)]oW + tVw- v 
i 

(g 2 ) (h)) 

- r )w] + ih — w 
y dt 

(30) 

The v terms arise from the pty P terms of (23) in accordance with (21). 
The (r — r ) terms arise from expanding the slowly varying functions <p, 
Pa, and r in Taylor's series about r = r ; all partial derivatives of these 
functions are thus obtained by expressing <p and Pa as functions of r and / 
and the values correspond tor = r , hence subscript zero. [See equation 
(24) for an example.] The antecedents in previous equations are in- 
dicated by the letters. 

All the symbols in equation (30), except i itself, stand for real quantities. 
This equation thus contains two sets of real terms: (I) those which 
simply multiply w by a constant — these are /, (ai), (g ) • (gi) and (k) ; 
(II) real terms proportional to r — r ; and (III) two imaginary terms 
involving derivatives of w. Equating set (I) to zero gives, symbolically, 

/ = (ai) + (go) • (gi) - (k). (31) 

[The contribution of (k) can be seen to be very small, corresponding to an 
energy of say 10 r 4 election volt, by arguments given at the end of this 
section. In any event (k) does not affect the charge density and hence 
the location or motion of the wave-packet since it enters \|/ only through 8.] 
The meaning of equation (31) can be seen most simply from !JC\|/ = 
ih(d\\i/dt); IKity was evaluated in (18) and it was shown that at r = r 
its value was &(P) + ( — e)<p(r ). The terms/, (g ) • (gi) and (k) from 
the right side of (30) combine to give essentially ih(di\i/dt). Thus choosing 
/ in accordance with equation (31) corresponds to choosing the correct 
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rate of change of complex phase angle in i|r so that the time derivative 
gives &(P) + ( — e)<p. Because (g ) • (gi) depends on the origin of co- 
ordinates, there is no important physical interpretation for/ itself. 
From set (III) we find 



— w = — v ■ Vw 
dt 



which means w = w(r — I vdt) (32) 



and indicates that the packet moves undistorted with velocity v. From 
this it is evident that 

r„ = v. (33) 

Actually the packet will spread owing to term (e) and to other small effects 
neglected in our approximations. 

Set II is satisfied for arbitrary values of r — r only if the vectors multi- 
plying r — r have a sum of zero. The resulting equation can be solved 
for?: 

(a 2 ) (b) (go) • (g 2 ) 

P = - {-e)V<P + V(Pa • v) - [(r \- V)p A + ~Pa^. 

Using to simplify (b) the relationships 

V(Pa ■ v) = (v ■ V)p A + v X (V X Pa) 

( — e) —e 

V XPa = V X A = — H, 

c c 

and for (g ) • (g 2 ) the equation 

r = v 

and the expressions (6a) and (5) 



{-e)E = (-,) 



\ dA~\ , N d 



dt 
we obtain the desired expression for P: 

P= (-e)lE + - c VXH 



(34) 



Still reserving for a moment consideration of the effect of the other energy 
bands, let us review what has been done. We have prepared a wave func- 
tion \|r which has the same charge density pattern, namely, |\}o»| 2 , near Tq 
as a wave function corresponding to state P of the Brillouin zone. At r 
this wave function satisfies, quite accurately, 3Cv|f = [&(P) + ( — e)<p\if 
corresponding to the energy of state P plus the added energy due to the 
potential <p; also it gives in the unit cell at r the same average velocity v 
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normally associated with P by v = VpS(P). This wave-packet then 
properly represents a localized wave function which corresponds to the 
extended wave function \|f/> in the absence of the fields. Under the in- 
fluence of 3Ci|/ = ihd^f/dt this wave function \|f changes. For convenience 
we have described this change in terms of the symbols P,f, and w which 
describe the wave-packet. The procedure of satisfying sets I, II, and III 
has been somewhat arbitrary. However, we have obtained a solution in 
this way and since the time dependence due to 3C\|; = ihd\\i/dt is unique, 
our solution for the rate of change of \j/ must be the only possible one. 
Because of the convenient form in which our solution is written, however, 
we can at once interpret it in terms of bodily motion and change in effective, 
P value. The results are in agreement with the interpretation of Chapter 7. 
We now return to the terms from the other energy bands. The wave- 
packet in terms of \J/j> does not constitute a solution of jK\}f = ihd^f/dt since 
terms from other bands enter through pty P in (b) and (d) and through Vj> 
in (k). We must, therefore, modify the wave form by adding a series of 
the form a{ty iP to \|/ P . We can evaluate the coefficients a t (which may 
depend on r) in this series by multiplying 3C\|r = ihdty/dt by exp ( — i8)ty iP * 
and integrating over the unit cell at r. The terms of set I do not cancel 
now because JCotyip = &i(P)tyi P for band ;'. This leads to a term of the 
form 

[&i(P) - S(P)] ai 

in the resulting equation. The other terms having a { as a coefficient all 
involve the size of the wave-packet or the fields and are small. The terms 
coming from the approximate solution \|/ P through (b), (d), and (k) are of 
the following orders of magnitude: 

(b) - (P M - p A ) ■ (i\ P \0) ~± e J^.* = nfUa; 

m m ca a 

(d) I(Mo).£-I*.A_JL 

m w m a na mn 

(e) KP ■ (*"| Vj.|0) ~heE-— = eEa/r. 

h 

The reader may verify, with the aid of Figure 10.2, that all of these energies 
are much smaller than S f (/») - &(P), which is a few electron volts, so 
that \ai\ is <C 1 and the contribution of other bands to the wave-packet 
is small. (Questions of convergence in the sum over i may be investigated 
with the aid of the techniques of the '^-sum" rule. See Seitz and Wilson, 
loc. cit.) The effect of term (e) has been discussed by Houston, and 
earlier by Zener 9 in a different form. It causes appreciable transition 

9 C. Zener, Proc. Roy. Soc. London 145A, S23-S29 (1934). 
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probability from one band to another when the voltage drop eE across one 
unit cell a is comparable to the energy gap &i(P) — &(P) where the bands 
are closest together. This leads to fields of about one volt per unit cell 
or 10 7 to 10 8 volts/cm. For smaller fields, the effect of other bands are 
shown by Houston to be negligible. We shall not repeat his rigorous 
treatment here but will be satisfied by the qualitative results obtained 
above. 

15.7 ON THE PAULI PRINCIPLE AND ANTISYMMETRIC 
WAVE FUNCTIONS 

We shall next consider in general terms the total wave function for M 
electrons occupying a certain set of quantum states having wave func- 
tions Xi ' ' ' X^* These wave functions will depend on the coordinates 
r = x, y, z of the electron and upon its spin quantum number s, which 
may take on either of the values +§ or —\. It is convenient to deal with 
wave functions which are eigenfunctions of s and to represent the spin 
wave functions as a(s) and P(j) where 

a(|) = P(-J) = 1, a(-i) = p(+i) = 0. 

The wave functions x are thus written symbolically in the form 

X«(?i) = <Mr,-)a(j,-) (!) 

where the symbol qi represents both coordinates and spin, and x« both 
the spatial and the spin wave function. In the next paragraph we shall 
consider certain general properties of the x's. 

If electron i occupies wave function x«> then the probability of finding 
it in volume dVi = dxidyidzi with a specified spin Si, a situation we refer 
to as dqt, is represented as 

IXnl 2 ^* = I^M^^VCfi). (2) 

The total probability regardless of spin value of finding electron i in dVi 
is denoted by p n (ji)dVi. It is obtained by summing (2) over the two 
values of S{, thus obtaining 

Pn(u)dVi = |<t> re (n)| 2 ^[a 2 (|) + a 2 (-|)] = |<Mr,-)|W. (3) 

The same result will be obtained for ^nO^PCfi) so that the particle density 
p n associated with x« is the same regardless of the spin function involved. 
In calculating average values, it is necessary to sum over the allowed values 
of j t - as well as integrating over dVi; this is represented symbolically by 

/ dqi. It is evident that, if the <(> A are an orthonormal set, then so are the 

Xn and, furthermore, that wave functions of opposite spin are orthogonal 
since a(j)p(j) is zero when summed over s. In the following section we 
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shall consider a particular set of wave functions <(> appropriate to dealing with 
a valence band with one electron missing. In this section, however, we shall 
consider only general attributes of the problem of putting M electrons 
into M wave functions Xi • ■ • \m- We shall, however, assume that the 
<j>™ and x™ are orthonormal sets, as is the case for the wave functions of an 
energy band. 

A wave function which represents one electron in each of the M states 1 is 

Xo = Xi(?i )X2(?2)X3(?3) • • • Xuiqia). (4) 

Other equally good wave functions would have electron (2) in xi an d 
(1) in X2 or other similar permutations, the most general of which can be 
represented as 

%o = Xi( 2 <7i)X2(=%2) • • • XM&qM) (5) 

where S is a "permutation operator" which rearranges the qi. For ex- 
ample Sg l = q 2 , Qq 2 = qi and SLqi = qi for / > 2 defines an operator 2. 
which interchanges electrons (1) and (2). There are M\ different ways of 
rearranging the M different qt's and there are consequently M\ distinct 
possible 2's (including 2 = 1, the identical permutation which has no 
effect) and thus M\ wave functions of the form 2xo- 

We next consider two closely related questions: (1) which 2xo or what 
linear combination of 2xo ' s correct to represent the M electrons occupy- 
ing the M wave functions? (2) How should the Pauli exclusion prin- 
ciple be expressed so as to rule out wave functions such as 

X2(tfl)X2(?2)X3(tf3) • • • Xm(Qm) 

for which two electrons use the same X2 ? Both of these questions are 
answered by the requirement that wave function for the M electrons must 
be such a function of the ^'s, denoted by A-(qi, q 2 , • ■ • , q n ), that, if any 
two electrons are interchanged, the result is equivalent to multiplying 
A by — 1; thus 

A(?2. qi, 93, q^ • ■ -,qM) = -A(?i, q 2 , q 3 , q<t, • • -,?m), 
or in general 

A(£q) = «(2)A( ? ) (6) 

where e(2) = ±1 depending on whether the permutation represented by 
2 can be produced by an even or an odd number of individual inter- 
changes; as is familiar in the theory of determinants, to which we shall 
shortly refer more specifically, a given permutation £ can be produced 
in many ways by the cumulative effect of a set of interchanges of indi- 

1 Such a wave function will be good eigenfunction of 3C only under special circumstances; 
see 15.8. However, this question does not concern us here since we are interested in the 
interpretation of such wave functions independently of whether they are eigenfunctions or 
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vidual pairs of the q/s. Furthermore, for all sets of interchanges in pairs 
which produce &, the number of interchanges is always even, or else always 
odd, depending upon S. Hence the S's can be classified as "even" or 
"odd" and «(S) denned as above. A wave function of the form just con- 
sidered is said to be antisymmetrical, hence the symbol A. 

In terms of these definitions it can be shown (the proof is left as an 
exercise to the reader) that the only wave function of form A which can 
be produced by a linear combination of Xo and the permuted 2.\o func- 
tions is 

A = - 7 Le*(£>)%o; (7) 

other equivalent wave functions can be obtained from this A by multiply- 
ing it by a constant, but no linearly independent A can be produced. The 
factor 1/VmI serves to normalize the wave function, as we shall show 
below. [See text following equation (13).] This form of A is identical 
with the determinantal form 

xi(?i) xatei) ••• x^tei) 
xite2) X2(«2) ■•• X^(?2) 



A = 



(8) 



Xiteitf) Xzteitf) ••• XMiqui) 

as may be verified by expanding the determinant in terms of its permuta- 
tions. From the determinantal form it is seen that, if two of the wave 
functions used in Xo are identical, two columns of the determinant will be 
equal and the determinant will vanish. This shows that the antisym- 
metric requirement on A excludes the possibility of the same wave function 
being used twice; this is the analytical equivalent of the simpler statement 
that no two electrons can occupy the same quantum state. 

We shall next calculate the density of electrons produced by A. This is 
done as follows. The probability that electron 1 has its coordinates in 
the range 

dq\ = dxxdyidzi and s\ = —\ or s\ = -\-\ 
— dVi and si = —J or si = +f 

and electron 2 has its coordinates in the range dq 2 , etc., is, according to the 
basic quantum-mechanical assumptions, 

|Atei, q 2 , ■ ■ ■ , qM)\ 2 dqidq 2 ■ ■ • dq M - (9) 

If the wave function is properly normalized, the integral over all the 
dqi should lead to unity. The total probability that electron (1) be in 
the range dq\, regardless of the positions, and spins of electrons 2, • ■ • , M, 
is obtained by integrating over dq%, etc. (that is, integrating over dV^ and 
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summing over j 2 ) 5 thus obtaining 

Pi(qi)dqi = dq x f ■■] \k\ 2 dq 2 • ■ • dq M . (10) 

The value of p x (q x ) is the probability of finding electron 1 in the range dq\. 
If multiplied by the charge on the electron and summed over s x , it gives the 
average charge density due to this electron. We can evaluate p x by using 
form (7) for A. We have 

Pifoi) = r^ f ■ ■ ■ f £«(3)2 X o*£«(903t X o*2 • • • dqM, (11) 

where the first sum extends over the M ! set of Si's and the second inde- 
pendently over the set of £R.'s. Now since all the functions in the product 

Xo = Xi(tfi)X2(?2) • • • Xiu(qM) 

are different and since they are assumed to be orthonormal, the only terms 
which integrate to a finite value are those for which 2 = 91; for all others 
the permutation of q's in Xo * will be different from those in Xo and con- 
sequently at least one integration of the form X»»* (?»')X« (?»')^9 r » = will 
occur. Thus (11) reduces to a single sum over 2: 



Pi 



(<7i) =^T f-f %o*%^2 • • ■ dq M . (12) 



In the sum there will be (M — 1)! terms in which X i*(0i) X i(0i) occurs 
while 02 ■ ■ ' <7m are permuted through X2 • • • \m- Each of the M — 1 

integrations for any one of these terms gives a factor / Xre *(0i) Xn (g , i)^Q , i 

= 1, so that in the summation the term X i* (?i ) X i (q x ) is obtained (M — 1 ) ! 
times over. The same result will be obtained for every X n*(?i) X n(?i) ) so 
that we obtain finally 

Pifoi) = {M ~! )] ZXn*(qi)Xn(qi) = ^Z hn(qi)\ 2 . (13) 

If we now integrate pi {q x )dq x , the value is unity since the X „ are supposed to 
be normalized. (This shows that A is properly normalized.') We can then 
interpret p x as the probability density for electron 1, its value being simply 
the average for all the individual charge densities in the set of M wave 
functions x> From the symmetry of A and of the result for pi(qi), it is 
evident that the same density will be obtained from every electron lead- 
ing to a total average charge density Pr(<7i): 

Pr{qi)dqi = !L\xn(ai)\ 2 dq u (14) 
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where pt(Qi)^Qi is the average number of electrons found in dq\\ that is, 
it is the probability of finding electron 1 in dq\ plus similar probabilities for 
all the other electrons. The net charge density is thus just the same as if 
Xo itself, rather than A, had been used so that the Pauli principle does not 
affect the additivity of the charge density. 

A precisely similar treatment can be carried out for the current operator 
corresponding to each electron. The same orthogonality conditions will 
eliminate terms with mixed x«* an d Xm with n ^ m and the end result is 
that the average value of the current due to the M electrons is simply the 
sum of M currents corresponding to the M wave functions xi to X-^- 

We must next show that if the time dependence of each individual x* 
is obtained by requiring it to satisfy the equation 

3CiX<(«i»') = ihj ( Xi(qut) (15) 

where 3Ci is the Hamiltonian operator for the coordinates and momenta of 
electron 1 in the presence of the applied field, then the time dependence 
of A is properly given by expressing A in terms of the time-dependent x's. 
The equation which A must satisfy is 

DC r A = pCi + W 2 + ■ • ■ + 3Cjf)A = ih-A. (16) 

at 

We shall show that, if A is expressed in form (7), each individual term 
satisfies this equation and hence that A does. It suffices thus to investi- 
gate Xo- We have 

3CrXo = [3CiXi(?i')]X2(?2, t) ■ ■ ■ xm(Qm, t) 

+ XiiQi, ')P(2X2(02, /)]•■■ XM{qu, t) 

+ X.i (?i> ')X2(02, t) ■ ■ ■ rJCjuXMiqM, /)] 
= it>-xi(qu t)x2{q2, t) • ■ • Xm(<1m, t) 

+ i^xiiqi, t)\2 (q 2 , t) ■ ■ ■ Xm(Qm, t) 



+ ihxdqi, ')X2 (?2> /) • • • Xm(Qm, t) 

which proves the desired result. 

The Fock Equations. The determinantal wave functions are only 
approximate solutions for wave equations in which electronic interactions 
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are included. The problem of finding the best set of x's f° r a given 
Hamiltonian has been considered by Fock 2 and Slater 3 and the resulting 
equations which the individual one-electron x's must satisfy are called 
the Fock equations. In these equations each electron is acted upon by 
the average potential of the other electrons, but in taking the average the 
influence of the Pauli exclusion principle is included. The Fock equations 
are treated in texts by Frenkel 4 and by Seitz; 5 the latter author makes 
special application to the problems treated in this appendix and uses 
Koopmans' theorem 6 to show that changes in the energy of the system are 
given by changes in &(P) for the individual wave functions. 

15.8 THE CONSTRUCTION AND BEHAVIOR OF A HOLE 

WAVE-PACKET 

In this section we shall deal with a valence band which is missing one 
electron so that in effect we shall study the properties of a hole. We shall, 
for simplicity, suppose that the missing wave function comes entirely from 
the states with one value of the spin so that the states with the other value 
of the spin are fully occupied. If the band of energy levels is completely 
filled, then all of the allowed quantum states will be occupied. Conse- 
quently, the application of electric or magnetic fields cannot change the 
way in which the electrons occupy the quantum states since it is required 
by the Pauli exclusion principle to use all the states to accommodate all 
the electrons. As a consequence, the electrons in a filled band are not 
strongly influenced by electric or magnetic fields but instead contribute 
only to the dielectric constant and the diamagnetic susceptibility. How- 
ever, if there is a hole in the valence band, currents will be produced by 
electric fields which will be modified by magnetic fields in just the way 
expected for a positive charge. This is the theorem which we shall prove 
on the basis of equation (1) of Section 15.6 and the Pauli principle. 

In this treatment we shall suppose that the spatial (not spin) parts of 
the wave functions of the electrons in the valence band all satisfy the same 
Schroedinger equation 

3Co * a = \jtn p2 + ^H * a(r) = 6a * a(r) - (1) 

If periodic boundary conditions are employed, a suitable set of i|f a are the 
N wave functions y\/p where N is the number of unit cells in the crystal. 
For other boundary conditions also, there will be N wave functions covering 

2 V. Fock, Z. Physik 61, 126-148 (1930). 

3 J. C. Slater, Phys. Rev. 35, 210-211 (1930). 

4 Wave Mechanics, Advanced General Theory, Oxford, 1934. 

6 Modern Theory of Solids, McGraw-Hill Book Co., New York, 1940. 
6 T. Koopmans, Physica 1, 104-113 (1933). 
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the same band of energies. Since there are severe conceptual difficulties, 
however, in combining periodic boundary conditions and magnetic fields, 1 
we shall consider a rectangular block of crystal outside of which the wave 
functions must vanish. We shall refer to this set of wave functions as \\f a . 

We first form a wave-packet function fa by superposition of the wave 
functions i\r a . From the completeness of the set of wave functions \|/«, it 
follows that they can be used to produce as well defined a wave-packet for 
an electron in the valence band as the wave functions ty P used in Sec- 
tion 15.6. However, when the wave-packet formed with the \|f a reaches 
the boundary of the crystal, it will be reflected since the \|r a functions vanish 
outside the crystal; whereas if it were formed from the i|/p with periodic 
boundary conditions, as it impinged on one face it would re-enter the 
opposite face. We shall suppose that we deal with wave-packets cor- 
responding to a hole near the center of the crystal so that these matters 
need not concern us. 

We consider, therefore, a wave-packet fa (r, /) which is composed of a 
sum of the \ff a : 

N 
4>l( r >') = £ *la{Wa(T). (2) 

We suppose that <j>i is normalized. Under the influence of electric and 
magnetic fields, the wave function <j>x will transform so that the wave- 
packet is accelerated in accordance with equations (1) and (4) of Sec- 
tion 15.6. In the following equations, we shall use the symbol <j>! to 
represent a time-dependent function which satisfies the wave equation 

VCfa = ih-fa (3) 

where 3C includes terms due to the applied fields. 

We next construct N - 1 other wave functions, fa(r, /) which are also 
solutions of (3) and together with fa form an orthonormal set. It is 
possible to do this because at any particular time, say / = 0, we can always 
construct an orthonormal set from fa and any N - 1 functions \\t a which 
together with <j>j. form a linearly independent set of N functions. 2 These 
other fas may be written as 

iv 

<l>n = L ««a(/)l|f«(r) (4) 

a =1 

where the dependence of a na on / makes them solutions of (3). Further- 

1 Probably closely related to the question of the existence of magnetic poles. 

2 See, for example, E. Madelung, Die Mathematischen Hilfsmittel des Physikers, p. 24, Dover 
Publications, New York, 1943, and Courant-Hilbert, Methoden der Mathematische Physik, 
Kap. II, par. 1, Dover Publications, New York, 1947, and the discussion in connection with 
formula (16) of Section 14.5. 



J 

dt 
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more, if the set of other functions <)>„ and <}>i are orthonormal at t = and 
all satisfy the same equation (3), then they are orthonormal at all later 
times. This follows from the Hermitian property of 3C as follows: 

f$n*$ m dr = f [Q 3C4>„Y <|> m + <t>„* I 3C<t> m ] dv 

= kf [ ~ (3C<W **" + ♦»* DC *»^ = °> (5) 

which shows that the integral is constant so that the orthonormal relation- 
ship is preserved. 

Charge Density and Current Density for a Hole Wave-packet. We shall 
next use the IN wave functions, consisting of <j>i(r)a(j), • • • , $.y(r)a(j) for 
one spin and <t>i(»")P(-0, • • • , ^ivOOPCO for the opposite spin in determi- 
nantal wave functions in accordance with the principles of Section 15.7. 
In this analysis, <t> 1} which represents the missing electron and corresponds 
to the hole, is imagined to be made from wave functions near the top of the 
valence band for which the energy decreases with increasing P as discussed 
in connection with equation (4) of Section 7.6 in the text. This feature is 
essential in determining the behavior of the hole. 

We consider first the wave function for the filled band. The antisym- 
metrical wave function for this case is a determinant using all the <|>'s. 
We shall denote it by A/ (<(>), the/, for "full", meaning that all the <|>'s are 
used so that the band is full. However, since the <|>'s are simply linear 
combinations of the \|/ a 's, this determinant has columns which are linear 
combinations of the columns of A/(v|/) which is made by using all the \}f's. 
Since a determinant is unaltered by adding to one column linear combina- 
tions of other columns, A/(<j>) and A/(\j/) represent the same wave function 
except for a multiplying factor with absolute value unity. 

Now under the influence of electric fields, A/(\|/) will make contributions 
to the dielectric and diamagnetic susceptibility but will not contribute any 
conduction current. Since A/(<|>) is the same wave function, it will, of 
course, vary in time in the same way and produce the same effects as A/(\|/). 

Next we consider A p (<|>) which is made up of all the wave functions 
except <t>ia. It represents a charge density p v . According to equation 
(14) of 15.7, p p is equal to the sum of the charge densities for all the wave 
functions it contains. It is, therefore, related to the electron (not charge) 
density p/ of A/(<j>) by the equation 

P P (r) + Pi(r) = Pf (r), Pl (r) = \^(r)\ 2 (6) 

since the sum in p p (r) differs from p/(r) only in having j<|>i(r)| 2 = pi(r) 
missing. Similarly the particle (not electric) currents due to Aj,(<(>) will 
differ from those of A/(<j>) only by the current of the wave-packet <|>i. If 
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we let /represent the particle current (electrons/cm 2 sec), then the electric 
charge and current densities will be 

— ep p — epi = -ep f or -ep p = -ep f + ep x (7) 

— eJ p — eJ\ — —eJf or — eJ p = — eJf + eJ\. (8) 

Thus the added hole will add the charge and current corresponding to a 
charge -\-e moving with the wave-packet motion of <j>i. 

In order to prove that this behavior corresponds to a plus charge and a 
plus mass, we must show that the acceleration of the particle is given by the 
appropriate formula. Since the wave-packet is made from wave functions 
near the maximum energy in the band, the value of 6 (P) will be given by 
equation (4) of Section 7.6 and will lead to equation (5) of Section 7.6 or 

v = -P/m, (9) 

where m is a positive quantity. Inserting this in the equation for P 
derived as (34) of Section 14.6 leads to 

mi, = -p = + e [& + „ x H/c] (10) 

which is appropriate to a positive charge. 

Thus if the wave-packet function for the electron missing from the band 
comes from the top of the band, so that the minus sign occurs in the rela- 
tionship [equation (9) above] between v and P, then the motion of the 
remaining electrons is such that the currents and charge densities they 
produce are the same as if a wave-packet of plus charge and plus mass 
were moving under the influence of the fields. 

These results can be given a more general expression in terms of the 
momentum and energy of all the electrons in the energy band. We repre- 
sent by P n and P p , & c (P n ) and & V (P P ), and v n = Vp n S c (/ > „) and 
v p ~ Vp P &v(Pp) the values respectively for an excess electron and a hole 
of the momentum, the energy and the group velocity for an appropriate 
wave-packet wave function. The energy & B and momentum P B for the 
entire energy band are 6> B = &o + S C (P„) andPs = P n for an electron and 
Sb = S — & v (Pp) and P B = — P P for a hole. In terms of these defini- 
tions, the equations for packet velocity and acceleration may be expressed 
in a symmetrical way in terms of 6> B and P B as follows : 

Excess Electron Hole 

PB = Pn P B = -P p 

(a) 
Ss = So + &c(Pb) &b = &o - & v (-Pb) = So - &v(P B ) 

(b) 
V Pn & c (Pn) = V Pb It9b(Pb) v p = Vp p S„(P p ) = Vf b S b (P b ) (11) 



-P n = -e\& + -v n XH\ Pb= -P p =+e\& +-v p XH\ 



(12) 
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where equality (a) holds because £„(?) = & V (-P) for any energy band 
and (b) follows because the same fact leads to v(P) = — v(—P). The 
last two rows show that, in terms of the energy and momentum of all the 
electrons, the group velocity and Pb are given by formally identical equa- 
tions except for the sign of the charge. Both velocities correspond to a 
positive mass since &b(Pb) increases in both cases as Pb varies from the 
point of minimum energy for 6>b, this point corresponding to the particle 
at rest. 

Hole Wave-packets in Terms of Energy Eigenf unctions. The wave func- 
tion A p (<}>) for the hole wave-packet is not an eigenfunction for the Hamil- 
tonian for the 22V — 1 electrons in the valence band. However, it is pos- 
sible to construct A p (§) by superposition of a set of eigenfunctions for the 
2/V — 1 electrons ; we shall show how this is done, the procedure and the 
result both being instructive in regard to a number of the properties of the 
holes. 

The wave functions \|/ a , which are superimposed to form <(>i according to 
equation (2), which we repeat for reference here, 

JV 
a — 1 

are eigenfunctions of the one electron Hamiltonian operator 3C with 
eigenvalues t& a [see equation (1)]. From this it follows that an antisym- 
metric wave function made of the \J/ a is an eigenfunction for 2N — 1 
electrons with an eigenvalue S = £]§« summed over the i\i a used in the 
determinant; the proof of this statement is readily obtained by the 
procedures used in equation (17) of Section 15.7 in which the time de- 
pendence of a determinantal wave function was studied. Such deter- 
minantal wave functions can be conveniently designated by the quantum 
number of the missing \\rp, and we shall let M^ (\[r) represent the wave 
function of the 27V — 1 electrons; we shall later in equation (13) give a 
rule which specifies the complex phase of M^. Compared to the energy of 
a full valence band, the energy of M|}(\|/) is evidently — &$. 

Our problem is now to find an expression for A p (<j>), which has <j>i missing, 
in terms of M^xji). The solution, as we shall verify below, is 

A„(40 = £«ia*M«(+) (13) 

a 

where the a ia * are complex conjugates of the coefficients in (2). 

This result is obtained by expanding the wave function for the full band 
in two ways and equating the resulting expressions. If the phases of A/((j>) 
and A/ (\|f ) are properly chosen, we may write 

A/010 = A/flO. (14) 
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Now both of these wave functions stand for a determinantal wave function 
of 2N electrons with the coordinates of electron 1 occurring across the top 
row [see equation (8) of 15.7]. They can then be expanded in products of 
the first row functions and their minors. The minors of A/ (\|f) are evi- 
dently just the functions M a (\|f) expressed as functions of the coordinates 
and spins of electrons 2, 3, • • • 2N. Thus we may write 

A/ty) = EtM?i)M a 0|f; 92, ■ ■ ■ , Q2n). (15) 

a 

This serves to specify the phases of the M a functions. Similarly 

A/(<|>) = E<J>.(?i)M,(<|>;?2 • • • ?2iv) (16) 

8 

where Mi(<|>, q 2 • • • q 2 N) is evidently, except for a phase factor, Aj, (<}>)• 
In order to evaluate Mi(<|>) in terms of M a (v|/), we equate (16) to (15), 
multiply by §i*(qi) and integrate over qi (which includes summing over Si). 
Since the <j>'s are an orthonormal set, we obtain 

Mi(<l>;02 • • ■ ?2Jv) = ZZJ ^i*(qi)^a(qi)dqiM a (y\f;q 2 ■ ■ ■ q 2N ). (17) 

The coefficient of M a is readily found from (2) by multiplying by \|; a *, 
integrating, and taking the complex conjugate. This leads to the value 
ai a * for the coefficients in (17), a result recognized as equivalent to equa- 
tion (13) which was to be proved. 

It is evident that, if the tb a for the expansion functions in (13) are not 
equal, then the M a 's correspond to different energies, so that, as stated 
earlier, A p (<|>) will not be an energy eigenfunction. 

The method of expanding A p (<j>) in terms of the M« affords a way of 
dealing with holes without the one electron approximation introduced in 
equation (1) of this section. For a band with one missing electron there 
will be N — 1 eigenfunction solutions which are actually exact solutions 
and not approximations like the determinantal functions dealt with in this 
section and the previous one. When these exact solutions are combined 
according to (13), an exact solution representing a localized deficit of 
negative charge will be obtained. Dealing with these exact solutions will 
then lead to results like (11) and (12) in which v will be defined as 
Vp s S s (P g ), where P s and S S (P S ) describe attributes of the exact solutions 
for the .System. From considerations of this sort, the behavior of a hole 
can be obtained purely from the characteristics of a band missing one 
electron without making use of one electron wave-function approximations. 
In other words, although the treatment here has been based on approxima- 
tions, the fundamental attributes of a hole are more basic and would be 
obtained in an exact treatment as well. 
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15.9 PERIODIC BOUNDARY CONDITIONS AND 
ELECTRIC FIELDS 

To a considerable degree, this section duplicates the results of 15.6. 
There are, however, important conceptual differences. In Chapter 6 we 
discussed the currents due to the Bloch wave functions, which represent 
uniform charge density in the crystal. The calculation of conductivity is 
often carried out on the basis of such a picture; however, the procedure 
used usually appears to be internally inconsistent. The inconsistency 
arises from the fact that the wave functions which initially satisfied the 
periodic boundary conditions change their P values uniformly under the 
influence of an electric field so that they no longer satisfy periodic boundary 
conditions. This apparent inconsistency is eliminated by a detailed con- 
sideration of the way in which the wave functions vary under the influ- 
ence of a field when a model having truly periodic boundary conditions is 
employed. 

The difference between this section and 15.6 is thus that we here deal 
with extended wave functions each corresponding to a single value of P, 
with periodic boundary conditions and with electric fields only. To 
extend the method to include magnetic fields appears to offer considerable 
difficulties and we do not attempt it. We shall show that if the conditions 
considered in 15.6 are made to correspond to those considered here, the 
behavior of the wave functions become the same. 

We shall consider in this section a model, similar to that of Figure 5.6 
or 5.9, which physically has periodic boundary conditions in one direction 
and "vanishing" boundary conditions in the other two dimensions. An 
electric field in the direction of the periodic boundary conditions will be 
applied and the resulting behavior of the wave functions will be shown to be 
consistent with the description of Section 7.2 of shifting values of P v , 
the component of crystal momentum parallel to the field. 

We imagine that the crystal is in the form of a thin rectangular sheet 
which is bent to form a hollow cylinder. The periodic boundary condition 
applies in the 0-direction of a cylindrical coordinate system. The electric 
field in this direction is applied by making the cylinder surround an elon- 
gated concentric transformer core, so that the cylinder forms a single 
short circuited turn. If the thickness of the sheet is made small enough 
compared to the radius of the cylinder, the bend can take place elastically 
so that the atomic arrangement is perfect except for a curvature of the 
lattice. For a sufficiently large radius r compared to the thickness Sr 
of the crystalline sheet, the wave equation can be made to differ by a 
negligible amount from that for a flat sheet with rectangular coordinates 

x = r — r , y = r 8 <£, z = z. (1) 
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We shall consider below the limiting case as 5ro/r — >0 so that set (1) 
become effectively rectangular coordinates. 

We shall consider an electric field which varies slowly so that retardation 
effects can be neglected. If the flux in the core is represented by 

JB z dA z = -c(v(t)dt, (2) 

where V(t) is the potential difference which would appear across an open- 
circuited secondary, then the field outside the core may be represented by 
the vector potential alone and the vector potential and fields are 

cfnodt 

A = i+^— ,E=--A = U—-^-,H = 7XA = 0, (3) 

lirr c lirr 

where i v is a unit vector in the <p direction. 

The kinetic energy operator in the presence of the vector potential is 

2^ (P 2 -PPa- Pa ■ P+Pa) 2 = ^(P 2 ~ 2Pa-P+Pa) 2 (4) 

where px = { — e/c)A is the proper term to introduce for a particle with 
charge —e. Since V • A = 0, it follows that p ■ Pa — Pa- P- In cylin- 
drical coordinates (4) becomes 



1m 



2 (#_ 1 1 1 #_ d_\ 

1 ] \dr 2 r dr + r 2 d<t> 2 + dz 2 ) 



- 2 - + pA 2 (r, t) 

lirr ir a<j> 



(5) 



where p has been expressed as (ft/*)V. If we substitute the variables 
x, y, z of (1) into this expression, it becomes 



1m 



+ 1m 



( d 2 . f_ £\_J-')f y W'ia 

' dy 2 + dz 2 ) 
1(-e) fv{t)dt 



■ }il [f? + a> + i) - 2 L § + ^ t] 



h 2 d 



r dx lirro 



(M'HM^H-* 6 ' 



The second line represents the correction term due to the finite radius of 
curvature. In it the term (h 2 /r)d/dx stands approximately in the ratio 
a/ro to h 2 d 2 /dx 2 of the first line where a is the lattice constant. The last 
two terms can be made negligible by making the sheet thin so that in it 
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r — r < r . Thus the kinetic energy operator is well represented by 

7T t^ 2 - 2 P**Pv + P*& (7) 

2m 

where p is now (h/i)V in respect to the quasi cartesian coordinates xyz and 

pAy = ± = (-«)/ -£,(/)* where £„(/) = - V{t)/2*r Q . (8) 

The quantity pAy is seen from (3) to be one component of p A of 15.6 
equation (5). For the elastically bent lattice we may take 6 U as a peri- 
odic function ^(tf, j, z) = ^{x + n x a, y + n„a, z + n z a) of the coordi- 
nates x, y, z where a is the lattice constant and the n's are integers. With 
this potential, the wave equation for y\r(x, y, z, t) becomes 

3C+ = [^ (P 2 ~ 2p Ay p v + p A y 2 ) + %] + = «+. (9) 

A good solution to this equation can be obtained from the solutions 
for pAy = 0. These solutions correspond to a periodic boundary con- 
dition in the _y-direction and vanishing boundary conditions in the x and 
z directions. For any fixed values of the x and z quantum numbers, which 
correspond to \P X \ and \P*\ in this case, we shall have solutions for values 
of P y satisfying the periodic boundary conditions. These solutions can 
be written in the form 

^ Py = j p >*i*u Py {x,y,z) ( 10 ) 

where we have omitted |P X ||P Z | which remain constant in the subsequent 
equations. Since \|fp„ is an eigenfunction of 3C , the Hamiltonian operator 
for pA V = 0, we have 

3Co*p v = e**"* [^ (/ + 2P*P* + Py 2 ) + ^] Up, 

= 6(P y )i|*, = eW'&iPJup,. (11) 

The approximate solution to (9) which reduces to \|/p„ when pA V = 0, is 

^ = e'^/'V.-M, exp Js(P y - p A y)dt/ih 

s e i(P„w*)+« Ui>s _ p4v (12) 

where the integral in the exponential is represented by 6 and P y is a con- 
stant independent of /. We shall show that (12) is an approximate solu- 
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tion of 3fC\|/ = ihd-ty/dt by evaluating the two sides of the equation. 
3C\|/ is 

X [^ (P 2 + IPyPy + Py 2 ~ 2p AyPy ~ IpAyPy+pAy 2 ) + ^J Up B - 



VAv 



« P V*IV +iS [^ [p 2 + 2(P y - PAy )P y + (Py -p A y) 2 ] + Gll]up„ 



= e «^/»)+« 5(P„ - ^up^, (13) 

and ?^3^/d^ is 

ihjfl = ^'^[-iKpAy-^UPv-M + &(Py - pAy)*Pl- PA \\. (14) 

Thus the approximate solution satisfies the wave equation except for the 
term in du/dP y . We have discussed a term of this sort in Section 15.6. 
As was pointed out there, this term introduces negligible corrections for 
electric fields of reasonable magnitudes. We may, therefore, regard (12) 
as a good solution and consider its meaning. 

Although the solution maintains the same value of P v in the exponential 
term, so that its complex phase runs through the same number of cycles 
in the length of the crystal and thus continuously satisfies the periodi.c 
boundary conditions, the behavior of the energy and velocity associated 
with the wave function varies just as if P y were increased at a rate — eE y 
as discussed in Section 7.2. This is seen for the energy directly from equa- 
tion (13) which shows that 

[y*Xtydx = &(P y - pAy) = 6(P„ - ejEydt). (15) 

By applying the group velocity formula for v v to this energy we obtain 

"y = 7^~&( p v- Pav) = "y(Py - pAy) (16) 

df y 

where v y (P y ) = d&(P y )/dP y is the velocity in the absence of p A - [The 
same result may be obtained by using the operator (p y — pAy)/tn for 
velocity [see Section 15.6, equation (7)] when a vector potential is present.] 
Thus although the change of phase of the wave function from one cell to 
the next is not altered by the field, the change in energy and velocity is 
just the same as if P y varied at a rate —pAy = —eEy so that the descrip- 
tion given in 7.2, although not analytically correct, leads to the correct 
results. 
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Another way of viewing the effect of the field is to say that the quantum 
number P y remains constant but that the energy and velocity are different 
functions of P y at different times, the variation in these functions corre- 
sponding to shifting the energy surface and velocity diagram at a rate 
-\-eE y with respect to the P y values. 

We may now compare the wave function (12) with the wave-packet of 
15.6. If the electric field is uniform and is produced through the vector 
potential only, the equation for P of 15.7 can be readily integrated. We 
have 



or 



P= (-e)(-- c A\+P =Po~Pa 



(17) 



(18) 



where Po is an integration constant. The spatial part of the wave-packet 
then becomes [in accordance with equation (11) for \|/ and (34) for P in 
Section 15.6] 

xfr = w [exp (ip A ■ r/h)] \\> Pl) _ PA (19) 

(since we assume E is uniform,^ is independent of position and£«io — Pa) 
and this may be rewritten as 

\|/ = w [exp (ip A ■ r/h)]{exp [i(P -Pa)- r/h]}\x P(r . PA 

= w[exp(tfVrA)]u Po _, A , (20) 

a form equivalent to (12) of this section. Thus although P varies in 15.6, 
if the field arises from A only, the complex phase factor in front of the 
periodic u does not change. If, however, electric fields due to <j> are 
present, the complex phase factor will change. For the periodic boundary 
conditions used in this section fields due to <f> are excluded since <t> must 
be single-valued. For the treatment of Section 15.6, the change from 
constant Po to varying Po may be brought about by changing <f>, A and iff 
by a gauge transformation; and the gauge transformation leaves the 
physical condition described by the wave function unaltered. To sum 
up, for a wave-packet in a uniform electric field the behavior of the spatial 
part of the wave function is the same as that of the extended wave function 
of this section, except for the factor w. 

Finally it may be mentioned that one can think of a model of the sort 
considered here for which periodicity in 2 as well as y is achieved. This 
can be accomplished by continuing the structure in the z-direction for a 
distance z much greater than Tq. It then becomes a coaxial structure 
with a central transformer core and a cylindrical sheet of conductor. 
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This structure can then be bent into a toroidal structure so that the metal 
becomes a toroidal surface. If z >■ r , the local effect of curvature can 
again be neglected. The core inside the metal surface will produce an 
electric field in the jy-direction as before. A new field in the z-direction 
can be produced by changing the flux passing through the hole in the 
toroid itself. 



CHAPTER 16 

STATISTICAL MECHANICS FOR SEMICONDUCTORS 

16.1 DERIVATION OF THE DISTRIBUTION LAWS FOR 
SIMPLE QUANTIZED SYSTEMS 

16.1a. Basic Concepts. In this section we shall present a brief review of 
the application of statistical mechanics to quantized systems. The formulae 
will be derived for the Planck energy distribution for harmonic oscillators 
and for the Fermi-Dirac distribution. The relationship of statistical 
mechanics to thermodynamics will be touched on briefly. (A treatment 
of the relationship is presented in the Problems.) 

In the statistical mechanics of a quantized system the concepts of the 
system, its quantum states, distributions, and a priori probability play a 
major role. We shall consider these in turn below: 

By a system we mean a certain number of electrons and nuclei subject to 
certain boundary conditions. For example, we might have 4.52 X 10 
germanium atoms contained in a volume of 1 cm 3 with rigid, impenetrable 
walls. Usually it is a good approximation to consider the behavior of only 
part of some real system and to treat this part as if it were itself an entire 
system. One example of this sort is the 20 electrons and 36 quantum 
states considered in Section 10.2. As another example, the system may 
consist of a number Q of harmonic oscillators. The system of normal 
modes of atomic vibration for a crystal discussed in Section 17.3 is an 
example of this sort. A semiconductor contains parts of both types com- 
bined into a common system. 

The all-important attribute of the system for statistical purposes is the 
energy scheme of its quantum states — in other words, the complete list 
of its quantum states and their energies. This list may also be referred to 
as the energy level scheme of the system, provided that proper allowance is 
made for the fact that several quantum states may belong to the same 
energy level. It is seldom possible to deal with an exact energy level 
scheme and it is usual to consider the energy of the system as made up of 
the energies of various parts, each of which is in its own quantum state. 
Approximations of this sort are frequently made and, as discussed in 
Chapter 13, furnish the best insight into the problems involved. The 
separate statistical treatment of electrons and lattice vibrations in Chapter 
17 permits the neglected interactions to be interpreted as causing transi- 
tions. In fact, it is difficult to see how to formulate the theory of re- 
sistance in terms of the exact quantum states of the entire system and it is 

452 



16.1a] A PRIORI PROBABILITY AND DISTRIBUTIONS 453 

almost certain that the interpretation would be much more laborious. 
(Such an exact procedure may be necessary to explain superconductivity, 
however. 1 ) Other examples of breaking the system into separate parts 
which are approximately independent are discussed in Section 5.2 and at 
the end of Section 15.7. 

The approximation made in taking the quantum states of the entire 
system as being composed of the quantum states of the subsystems in- 
troduces a negligible error if the neglected interactions of the subsystems 
are not large. This result, which we shall accept as an additional assump- 
tion here, can be established with the aid of quantum-mechanical theorems 
relating to the invariance of the trace of a Hermitian matrix under a unitary 
transformation of the wave functions. 

Once the system is considered to be broken into a set of subsystems, the 
concept of a distribution has meaning. We shall illustrate this concept 
first for the case of electrons governed by the Pauli principle (Sections 5.2 
and 15.7 and Chapters 9 and 10). The energy of the system may be 
described as the sum of the energies of the quantum states occupied by the 
electrons. We classify the quantum states according to energy and divide 
them into groups of approximately equal energy. In terms of this classi- 
fication, a distribution is specified by giving the number of occupied states 
in each group. To a given distribution there will usually correspond many 
quantum states for the system. For example, if one of the groups of states 
contains two states and the distribution specifies that only one is occupied, 
there will be two ways of choosing the occupied state. In general there 
will be more choices than this for each group so that there will be a very 
large number of ways of achieving the distribution, each way corresponding 
to a particular quantum state for the system. 

The central problem of statistical mechanics consists of finding the dis- 
tribution which can be achieved in the largest number of ways, for a given 
system with a given energy. The importance of this distribution is that 
it represents the properties of the system under equilibrium conditions. 
Several steps are necessary to show that the distribution having the most 
ways represents thermal equilibrium. The first step involves the introduc- 
tion of the assumption of equal a priori probabilities for each quantum 
state of the system. According to this assumption, if a system is set up in 
the laboratory so that it has a certain amount of energy and is next isolated 
so that it remains in a particular quantum state, then it is equally likely 
to be in any one of the possible quantum states of this energy; in other 
words all quantum states for the system have equal a priori probability, 
provided that they have the right energy. 

The assumption of equal a priori probability seems reasonable on its 

1 The recent theories of superconductivity of H. Frohlich [Phys. Rev., late (1950)] and 
J. Bardeen [Phys. Rev., 79, 167 (1950)] are concerned with showing that the lowest states of 
the combined electronic and atomic system can be superconducting under suitable conditions. 
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face value. It can, however, be made to seem still more plausible on the 
basis of the quantum-mechanical H-theorem. The proof of the H-theorem 
and other methods of examination of the foundations of the statistical 
mechanics of quantized systems represent advanced and difficult topics 
and will not be pursued further here. 

The possible quantum states for a given energy can be grouped, of course, 
into various distributions; and no quantum state will occur in more than 
one distribution. Among all the possible distributions there will be one 
which has more quantum states than any other. There will be other 
distributions which differ slightly from this one and can be achieved in 
approximately the same number of ways. For systems with large numbers 
of particles it can be shown that only the most probable distribution need 
be considered. The reason is that distributions which differ appreciably 
from the most probable distribution contain very many less quantum 
states. In fact, if all these differing distributions are combined, the total 
number of quantum states in them is negligible compared to the number in 
the most probable distribution. Hence there is a negligible probability 
of finding the system in any state except one which belongs to the most 
probable distribution, or to distributions very similar to the most probable 
distribution. 

A helpful means of thinking of the way in which the most probable dis- 
tribution arises is furnished by the concept of the microcanonical ensemble. 
For this purpose we visualize a very large number of identical systems, one 
in each of the quantum states having the prescribed total energy. In 
order to avoid difficulties arising from the discrete nature of the energy 
levels, the prescribed energy is usually represented as a small range from 
6>i to & 2 where the interval S> 2 — &i is a very small fraction of Si. This 
large number or ensemble of systems is the microcanonical ensemble and of 
this large number all but a negligible fraction are in quantum states be- 
longing to the most probable distribution. The observed value for any 
property of an actual system is, on the average, that obtained by averaging 
the property of interest over all the systems of the microcanonical ensemble, 
this conclusion being a direct consequence of the assumption of equal a 
priori probabilities. Because of the overwhelming importance of the most 
probable distribution, the average over the microcanonical ensemble of 
any quantity of practical interest is the same as that obtained for the most* 
probable distribution. Deviations from the value for the most probable 
distribution do, however, become significant for a small system, such as an 
individual colloidal particle, and can be calculated with the aid of the theory 
of fluctuations, one example being given in the problems. 

The procedure we shall follow is thus to set up a definition for the various 
possible distributions and then find the most probable distribution. The 
behavior of the system under equilibrium conditions will then correspond 
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to this distribution. This procedure is arbitrary to the degree that de- 
fining the distributions is arbitrary. We shall find in the examples dealt 
with, however, that the arbitrary conventions used in defining the dis- 
tribution drop out of the final answers and thus the choice of the convention 
need not concern us. 

We shall consider below a system consisting of a number of independent 
harmonic oscillators, all having the same frequency. For this case we shall 
work out the most probable distribution. The number of ways in which 
this distribution can be achieved will be denoted by JV{&), where S is 
the energy of the system. 

In order to define temperature on a statistical basis and to show the 
relationship of statistical mechanics to thermodynamics, we shall consider 
two systems in equilibrium. If the systems have energies Si and S 2 with 
Si + §2 = Sr where Sr is the total energy, then the number of ways of 
achieving the division into S x and S 2 will be W\ (§1)^2(^2). We shall 
outline' the arguments that show that temperature should be defined by 

d& kTi *• } 

where k is Boltzmann's constant, and entropy is defined as 

§! = it In ^ (Si). (2) 

Using these results, we shall derive the Fermi-Dirac distribution. 

16.1b. The Planck Harmonic Oscillator Distribution. We shall con- 
sider here a system composed of a large number Q of harmonic oscillators, 
each having the same frequency v and energy levels (n + \)hv, where 
n = 0, 1, 2, • • • is the quantum number and h is Planck's constant. We 
shall refer to the quantum states for the individual oscillators as 0-states 
to distinguish them from states of the entire system, referred to as S-states. 
We shall define the distribution in terms of a set of numbers Q$, Q\, Q2, • • • , 
Q n , • • ■ giving the number of oscillators in O-states having quantum num- 
bers 0, 1, etc. Evidently we must have 

Q = El?.. (3) 

n 

If the total energy of the system is S, we must also have 

hv 
^ = H(n + \)hvQ n = Q- + hvT l nQ n . (4) 

n ^ n 

A quantum state for the system, or S-state, is described by telling which 
oscillators are in O-states with quantum numbers 0, 1, 2, etc. A particular 
.S-state may thus be written as follows: 

(0/atcde)(l/fgk) ■ ■ ■ (s/ijk •••)••• (5) 
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which implies that oscillators a, b, c, d, and e are in O-state n = and /, g , 
and h are in O-state n = 1, etc. We shall refer to the expression above as 
a writing. The same Estate may, of course, be specified by other writings 
such as 

(fi/baedc) {\/ghf) ■ ■ ■ (s/kji •••)•••• (6) 

Our problem is to find how many ^-states correspond to a distribution such 
a s Qo = 5, <2i = 3, • • • , being careful not to count different writings of 
the same Estate as different ^-states. We shall first show that the number 
of writings of a given S-state is 

Qo\ Qi\ ■ ■ ■ = nQ n i (7) 

n 

as follows: Consider any given writing; the n = parenthesis can be 
written in Q \ ways (that is, there are 5 choices for the first letter after the 
/ and 4 for the second, etc., in the example above). Similarly each pa- 
renthesis can be written in Q n \ ways so that each Estate can be written in 
IlQ n l ways. Next we ask how many different writings are there for a given 

n 

distribution ? In any given writing, the symbols designating the oscillators 
are arranged in sequence and there are evidently just Q\ ways of permuting 
them in this sequence. Hence, there are Q\ writings in all. Since each 
•S-state occurs written II0„! times in this set of writings, the distribution 

n 

can be achieved in 

w{q^ 0i, ■ • ■ ) = ei/ne.i (8) 

n 

ways; i.e. there are Q\/UQ n \ different quantum states corresponding to 
the distribution. 

Which of the possible distributions is the most probable, that is, has the 
largest W, subject to a fixed total number of oscillators Q and total energy 
& ? We shall find this distribution by requiring that, for any small change 
in the Q n 's, the value of W remains constant. This does not bv itself 
prove that W is maximum ; W might be a minimum equally well. How- 
ever, the distribution so obtained can be shown by a separate calculation 
to maximize W. (This is left as an exercise for the reader.) Let us 
consider a perturbation, referred to as (Per) of the distribution specified 
as follows: 



(PerN 



This corresponds to changing 8Q r oscillators from O-state r to O-state s. 
We shall deal, for convenience, not with W but with In W and shall denote 



Qr -> Qr + BQ r 


(9a) 


Q*->Q* + SQs, 5Qe = SQr 


(9b) 


& -» 5 + 8&, 5S = hv{r - s)8Q r . 


(9c) 
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the change produced in In W by 5 In W. This perturbation does not con- 
serve the total energy S so that we must supplement it with another which 
restores the energy. Suppose that the second perturbation, (Per) , is 
designated by a prime and transfers 8Q V oscillators from O-state u to O-state 
v. Then, as we shall show below, we must have 

SlnfF (SlnW)' 

in other words, the change in the logarithm of W per unit energy change 
for the two perturbations must be equal and, since u and v for (Per) can 
be selected arbitrarily, the same value for (8 In fF)'/8&' must be obtained 
for all perturbations. From this result we shall readily derive the Planck 
distribution and later the Fermi-Dirac distribution. 

The proof of equation (10) is very simple. If W is to be a maximum, 
so is In W. Now consider changes represented by (Per) and (Per) : 

(Per): SQ r = -8Q S = 8&/hv(r - s) (11a) 

(Per)': 8Q U = -8Q V = 8&'/hi>(u - v). (lib) 

These will produce changes in In W of 8 In W and (5 In W) . If 8& = 
— 8$', these changes will not change & and, if W is a maximum, they will 
not change In W. The change in In W will be proportional to 8tb and 5d> , 
hence 

8\nfF= 8lnW+ (SlnJV)' 

-~ir s& + ~w~ 88 - (12) 

For this to vanish when S& = —8tb , the two coefficients must be equal, 
so that we may introduce a symbol /3 defined as follows: 

5ln/F (5ln#0' 

We shall later show that /3 = 1/kT where k is Boltzmann's constant and 
T is the absolute temperature. 

The expression for 5 In JV/8& in terms of the parameters Q n of the 
distribution may be readily obtained from the formula for In W and the 
derivative of a factorial. The latter may be taken as the value for the 
minimum change 8x = 1 in 8 In xl/8x; this leads to 

d\nx\ . In (x + 8x)\ - ln#! . 

= = In (x + 1) = In x, (Ha) 

dx 8x 

5 In x\ = 8x In x. (14b) 
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From this we readily obtain [using also (11a) and 8] 

5 In W = -5 In Q r \ Q a \ = -SQ r In Q r - 8Q a In Q. 
_ 85 In (Q./Q r ) 

hv{r - s) { } 

If we now imagine that u and v are held fixed and r and s are allowed to 
vary over all possible quantum numbers, we must have from (13) 

llnfT In (Q s /Q r ) 
^ = _ Sg" = hv{r-s)' (16) 

From this we find that 

phvr + In Q r = 0A« + In ft. (17) 

If we let s = 0, this shows that 

ft- = Qo^ hrr = Soe- 1 "- (18) 

y = jSAv. (19) 

The quantities /3 and £> are determined by the conditions on Q and & 
by the following steps: 

= EG. = Go(l + e-^+e- s * + ...)=— ^- (20) 

i — e ' 

g = (A*0/2) + A. £»ft, 
= (A*0/2) + fe6o(0 • 1 + le~? + 2e- 2 * + 3 e -^ + • • •) 

= (hvQ/2) + hvQo (- ^) (1 + e-? + e~ 2 ? + • • •) 



= (hvQ/2) - hvQ d 



dy 1 



B-T 



= (hvQ/2) + hvQ (1 _^ J_ 7)2 = A*0 [| + ^i-J • (21) 

These equations determine Q and S in terms of Q Q and r and can be solved, 
in principle, for y and Qo in terms of Q and &. We shall shortly show how 
y = /3A»< can be identified with hv/kT; this leads to the following expres- 
sion for the average energy of an oscillator: 

<g) A v. = ^[^ + efeW J_ 1 ]> (22) 

and to an average value of n of 

<">Av. =3^; r- (23) 
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The probability that any oscillator is in the state n is Q n /Q and is given by 
° = e -»W*r (1 _ e -hvlkT^ (24) 



Q 

The foregoing expressions give the Planck distribution for a system of 
harmonic oscillators. 

By considering second-order changes produced by the perturbations 
&Qr, SQ S , etc., it is possible to show that In W is actually a maximum and 
that it decreases very rapidly as the distribution deviates from the maxi- 
mum probability distribution. The value of In W for the most probable 
distribution is in general very large. This leads to a surprising and im- 
portant result: The value of In W and the value of In (the total number of 
quantum states with energy S) differ by a negligible amount. This is 
surprising because the number of quantum states in the most probable 
distribution is, of course, only a small fraction of all the quantum states of 
energy £>. A numerical example will help to show how the result comes 
about The value of In W for one gram atom of oscillators with 7=1 
will be about 10 23 . [This result may be derived from the expression for 
In W by using Stirling's approximation x\ = (x/e) x \r2irx for x, with 
proper precautions, and more readily from the formulae in the problems 
for this chapter.] Suppose that only one state in 10 20 is included in the 
most probable distribution. Then the total number of states will be 
10 20 exp (10 23 ) and the logarithm of this number will be 10 23 + 46. 
Evidently the term 46 is wholly negligible in In W. The largeness of the 
terms in In W frequently enables calculations to be made very inexactly 
so that W 'may be incorrectly estimated by large factors without significant 
errors being introduced into In W. 

16.1c. Relationship to Thermodynamics. We have shown that all per- 
turbations of the system of harmonic oscillators must lead to the same 
value, which we call |8, for 

^, 

From this we shall show that if W(&) represents the number of ways in 
which the most probable distribution can be achieved, then 



d& 



= p. (26) 



This is proved as follows: Consider the most probable distributions for 
energy S and for energy & + AS. We can evidently transform from 
one distribution to the other in series of steps of the form (Q r , Q B ) 
—> (Qr + SQ r , Qs + SQ S ) considered before. These steps will introduce 
energy changes &&, 8S>', 5&", etc., and for them the changes in In W 
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will be /35S, /35S', 05S", etc. The total change in In W will thus be 
/3(5S + 8&' + 8§>" + ••■)= 0AS where AS is the total change in energy. 
Hence A In PF/A& = /3 for the change from the maximum distribution 
at energy S to that at S + AS ; since the maximum probability distribu- 
tion is a function of S, this ratio is the desired derivative. 

Now suppose we have two systems, 1 and 2, which can share energy. 
We can then consider them to be a combined system with a total energy 
S = Si + S 2 . The possibility of sharing energy corresponds to putting 
two bodies in thermal contact so that they can exchange heat and come to 
thermal equilibrium with equal temperatures and maximum total entropy. 

As a result of their interaction neither system 1 nor system 2 has the 
same set of energy levels that it had when isolated. In fact the only 
energy levels are those of the system as a whole. However, if the inter- 
action is very small each energy level for the combined system differs by 
a negligible energy from the sum of the energy levels of the two isolated 
systems. Thus it appears reasonable to treat the combined system as two 
separate systems while at the same time imagining that systems 1 and 2 
may interact so as to transfer heat. This argument is far from rigorous, 
however, and it is difficult to prove rigorously that the behavior of two 
weakly interacting systems does properly simulate the thermodynamic 
behavior of two bodies in contact. For detailed examinations of these 
problems, the reader is referred to the literature. 1 

For any given division Si and S 2 of the energy, system 1 can be set up 
in /^i(Si) ways and system 2 in /F 2 (S 2 ) ways, for the distributions of 
maximum probability. In fact, as discussed at the conclusion of the 
derivation of the Planck distribution, so far as In ^Fi(Si) is concerned, 
we can equally well include all quantum states of energy Si. For similar 
reasons we can consider, if it is desirable to do so, the total number of 
states lying in a small energy interval Si to Si + SSi of system 1. For 
all of these choices, the large number aspect of W will lead to the same 
value In W. The number of ways of dividing S into Si and S2 may thus 
be described, so far as its logarithm is concerned, by In /Fi(Si)/F2(S 2 ); 
the most probable division is that for which this quantity is a maximum. 
(Because of the independence of In W of the exact definition of W, we 
need not be concerned even with having Si + S 2 = S; it is sufficient to 
have Si + f ^ fall within a range 5S about S.) The maximum of 
In /Fi(Si)/F 2 (S 2 ) corresponds to 

d . n/(F F , . ^ln/Fi(Si) iln/F 2 (S 2 ) 

— In JV{& U S 2 ) = = — — » (27) 

rf&i «G>1 ae> 2 

the last equation following from S 2 = S — Si. Hence under equilibrium 

1 R. H. Fowler, Statistical Mechanics, Cambridge, 1936. R. C. Tolman, The Principles of 
Statistical Mechanics, Oxford, 1938. 
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conditions 

0! = —z — and 2 = — ; jp — ( 28 ) 

must be equal while In W\ + In W 2 is a maximum. It can be shown that 
these conditions are equivalent to setting the entropy § equal to k In W and 
= 1/kT. (See Problems.) The equality of 0i and 2 thus corresponds 
to uniform temperature and the maximizing of W to the maximum entropy 
condition. 

We shall indicate the method of establishing the thermodynamic rela- 
tionships, in terms of a simple example. Suppose that system 2 consists 
of harmonic oscillators of arbitrarily low frequency. Then y = fihv is -Cl 
and 

From the classical theory of statistical mechanics, we know that a system 
of oscillators has average energy of kTQ. Hence we identify kT with 
l//3 2 for system 2 and since 0i = 2 we identify with 1/kT generally. 
This identification is not wholly satisfying, however, because the theorem 
& = kTQ itself involves a considerable amount of classical statistical 
theory. A better procedure would be to let system 2 represent an ideal 
monatomic gas. We should then find that 

pV = JV/02 (30) 

where p is the average pressure, V the volume, and TV the number of 
atoms. Since the thermodynamic temperature scale can be defined for 
a perfect gas by the relationship 

pV = NkT, (31) 

the identification of 2 with 1/kT would be satisfactory by this means. 
To carry out this procedure for a quantum gas is relatively straightfor- 
ward and would introduce few features not already discussed in Chapter 5 
and in the next section of this chapter; the purpose of this section is to 
introduce ideas and formulae rather than to establish them rigorously and 
we shall not give the derivation. A still more general procedure is to intro- 
duce the dependence of the energy of the quantum states upon the volume 
V or some other mechanical parameter. In terms of this, the equation of 
state in terms of & and V can be given and a general thermodynamic 
equivalence established. This procedure is presented in the problems. 

16.1d. Derivation of the Fermi-Dirac Distribution. We shall now ob- 
tain the formula for the Fermi-Dirac distribution using essentially the 
same method as that employed in obtaining the Planck distribution. 
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For this purpose we consider a system containing N electrons. The 
quantum states of the system are divided into groups, all the states of any 
group having approximately the same energy. Consider two such groups 
with energies & r and & 3 ; suppose that there are S, states with energy & r 
and that N r of these are occupied by electrons and P r (= S r — N r ) by 
holes. Our first problem is to find in how many ways the N r electrons 
can be distributed among the S r states. According to the Pauli principle 
(see Section 15.7), electrons are to be regarded as indistinguishable so that 
the quantum state of the system is specified by giving a list of the occupied 
one-electron states. Our problem is thus reduced to finding in how many 
ways the N r occupied states can be selected from S r states. To do this 
we list all of the S r states giving each a symbol a, b, c, etc. Then one 
arrangement of electrons may be written as 

{abejldcgh ■■■) (32) 

where the states to the left of the / are occupied and those to the right are 
unoccupied. It is at once evident that each distinct selection of occupied 
states may be written in N T \ P r \ ways; and also that there are S r \ ways 
of permuting the S r symbols in the entire expression while keeping N r 
symbols to the left and P r to the right of the /. Hence there are 

S r ]/N r \P r \ (33) 

distinct ways of choosing the N r occupied states. The total number of 
ways of setting up a distribution N u N 2 , • • • , etc., is, therefore, 

W= U(S r \/N r \ P r \). (34) 

r 

We now consider a perturbation of the distribution which involves 
shifting 8N r electrons from group S s to group S r : 

N r ^N r + 5N r (35 a) 

iV s -> N s + 8N S 8N S = -SN r (35b) 

SS = (fi, - & s )SN r . (35c) 

All of the previous arguments relating to the equality of (3 for such changes 
hold for this case so that we may at once write that 

1 8 In W 

Ti = *—vr' (36) 

The value o{8\nW is readily found in terms of 

5lniV r ! = 8N r \nN r (37) 

8 In P r \ = 8P r In P r = -8N r In P T (38) 
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and similar relationships. We thus obtain 

= IhK = * [-In JV r + In P r + In N 8 - In P 8 ]. (39) 

5(9 ts> r — ©8 

From this equation we obtain 

/3S r + In (N r /P r ) = 0S S + In N s /P a . (40) 

Since this equation must hold for all choices of s and r independently, both 
sides must be independent of r and s and can be written as /3&V. Introduc- 
ing the Fermi-Dirac distribution function f (d> r ) = f r by the equations 

N r = i r Sr and i» r = (1 - i r )S r , (41) 

we obtain 

Inj^-r-tKSF-Sr) (42) 

1 l r 

which leads to 

f ' = 7-I ST? TV (43) 

1 + exp /3(© r - &,) 

Since j8 = 1/*T this is the desired expression for the Fermi-Dirac dis- 
tribution. It is to be noted that the arbitrary division into groups con- 
taining S r states disappears from the final answer and that f is a function 
of the energy of the state, the Fermi level G> F , and kT. The determination 
of &f for semiconductors is discussed in Section 16.3. 

The interpretation of the Fermi level as a chemical potential is readily 
obtained from equation (40). For this purpose we suppose that SN 
electrons are added to the system. If these electrons are added to states 
in group r, the increase in the. free energy, 7 = S — T§ = & — kT In W is 

S? = 5S - kT8 In W = &JN + kTSNln (N r /P r ) 

= & F 5N (44) 

by virtue of (34), (40) and the definition of 6> . Thus the change 57is 
independent of the group of states into which the electrons are put. &f 
is similar to a chemical potential, which may be defined as the change in 
free energy of a system per added atom of a given chemical element. 

This result may be used to prove the conclusion stated in Section 12.4 
that the difference in Fermi levels <p is the voltage read on a voltmeter. 
For this purpose we suppose that 8N electrons are transferred from con- 
ductor A at Fermi level &fa to conductor B at Fermi level 5>fb, the trans- 
fer taking place reversibly through a motor so that 

the energy liberated = -ehN(V A - V B ) (45) 

where V A and V B are the voltages. Our problem is to show that 
— e (Vx — V B ) = &fa — &fb from which it follows that Va — V B = 
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<Pa ~ <pb according to the definition of the Fermi level <p of Section 12.4. 
We shall prove this relationship by applying the thermodynamical 
theorem that 5§ = for any reversible change in which no heat flows. 
This result is obtained in Problem 12, for the case in which work is done by 
pressure, in the form that d§ = kpdQ where d2. = d (energy of system) + 
d (work done by the system). We shall suppose that the voltage differ- 
ence is maintained by a battery connecting conductors A and B. If the 
system is thermally insulated, so that no heat can flow from outside, then 
the reversible shift of the electrons will leave dS = 0. Hence the work 
—ef>N(V a — V b ) done by the system must equal — 8$ and we may write 

-eSN(F A - V B ) = -5S = -(aS - T5§) = -57. (46) 

However, 57 can be written as the sum of 57 a — — &fa8N due to remov- 
ing 8N electrons from A plus 55 ' B = &fbSN due to adding them at B. 
Inserting 55 = 5? A + 83 B in (46) leads to 

-e{V A - V B ) = & FA - & FB , (47) 

the desired equation. 

16.2 CALCULATION OF THE EFFECTIVE NUMBER OF 
QUANTUM STATES IN AN ENERGY BAND 

In order to calculate the density of electrons in the conduction band, we 
may evaluate the integral of equation (9) of Section 10.3 using for 7V(S) the 
appropriate value given in equation (9), Section 9.1. This leads to 

n = exp {& P /kT) f * [ir(2m) H /h 3 KS - S C ) H exp (S/kT)JS 

= exp [(&? - & c )/kT)[4Tr(2m) n /h 3 ] f % °exp (-^/kT^&^d&x (1) 

where the substitution &i = S — S c has been used. Since the integral con- 
verges rapidly, the upper limit may be extended to infinity. The integral 
then becomes a standard form given in Pierce, "A Short Table of Integrals", 
Third Revised Edition, formula 496, the value being (TkT) H kT/2. Insert- 
ing this in (1) gives 

n = 2{2irmkT/h 2 )^ exp [- (S e - & F )/kT] (2) 

which is formula (11) of Section 10.3 quoted in the text. The effective 
number of states in the conduction band is thus 

N c = 2{2irmkT/h 2 )^ 
= 4.82 • 10 15 T ?4 cm" 3 
= 2.41 • 10 19 cm" 3 at 20°C (3) 

N 2 = 2.32 X 10 31 r 3 cm- 6 . (4) 
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[This same result can be obtained by integrating overp-space directly with- 
out making use of equation (9) of Section 9.1. For this method instead of 
N(&)d&, we use (2/h 3 )dP x dP y dP z with 5 = (P* 2 + P y 2 + P z 2 )/2m. The 
integral in (1) then becomes the product of three integrals each of which 
contributes a factor (lirmkT / h 2 )* to iV c .] 

16.3 TEMPERATURE DEPENDENCE OF THE FERMI LEVEL 

In this section we shall show in more analytical terms than employed in 
the text the way in which &f and the electron and hole densities depend 
upon the temperature. We shall deal with the same distribution of 
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Fig. 16-1 — Distribution of Quantum States in Energy. 



quantum states that was discussed in Chapter 9. This is reproduced in 
Figure 16.1, with the effective number of quantum states in the filled and 
empty bands indicated. As explained in Chapter 10, the requirement for 
electrical neutrality is that the sum of the positive charges: 

p = density of holes in the filled band 
p a = density of holes bound to acceptors 
Nd = density of donors 
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must equal the sum of the negative charges : 

n = density of electrons in conduction band 
n d = density of electrons bound to donors 
N a = density of acceptors. 

This leads to the equation 

P + Pa + N d = n + n d + N a - (1) 

In Chapter 10, formulae for these densities are given in equations (8), 
(11), (13), and (14) of Section 10.3. These depend upon the nature of the 
semiconductor through the energy values of & v , & a , & d and § c , the effective 
masses in N v and N c , and the impurity concentrations N a and N d - Given 
these quantities, our problem is to solve for the n's and p's as a function of 
T by solving equation ( 1 ) for & F . This is done, as shown below, by plotting 
the left and right sides of equation (1) each as a function of &f and finding 
the solution graphically. The example chosen corresponds approximately 
to «-type germanium, which at room temperature would have a resistivity 
of 2.4 ohm-cm — the same as for the example discussed in Section 10.4. 

The values used for this example are as follows: 

& v = —0.36 electron volt, & a = —0.32 electron volt, 

&d = +0.32 electron volt, S c = 0.36 electron volt. (2) 

(This corresponds to arbitrarily setting the zero of energy at the midpoint 
between the bands.) The values of N a and N d are 

N d = 10 15 per cm 3 , N a = 10 14 per cm 3 (3) 

and the electron and hole masses are taken to be the same as for a free elec- 
tron so that equation (3) of Section 16.2 applies. 

The densities obtained upon the basis of these assumptions are plotted 
in Figure 16.2 on a logarithmic scale as a function of 6> F for T = 300°K. 
Positive charges are drawn as solid lines and their sum, which is the left 
side of equation (1), is shown as a heavy line. The densities of negative 
charges are similarly indicated by dashed lines. Where the heavy lines 
cross, equation (1) is satisfied and the condition for electrical neutrality 
is realized. 

The behaviors of the individual curves for the negative charges can be 
understood as follows: 

N a - This is simply the fixed negative charge due to each acceptor. 

(This charge is not neutralized by its share of the valence bonds.) 

n. This is the charge of the electrons in the conduction band. According 

to Section 16.2 it is about 2.41 X 10 19 exp [(-S„ + & F )/kT]. 

Expressed in electron volts this is 2.41 X 10 19 exp ( — 39AF) where 
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16-2 — Solution for gy with Density of Acceptors, N a , = 10 14 i 
Density of Donors, N d , = 10 ls cm- 3 and T^ 300°K. 



AV is the voltage of &f below & c or 

n = 2.41 X 10 19 ■ l(r lw . 



(4) 



n& 



This gives the straight line shown in Figure 16.2. (When &p 
approaches within kT of S> c , the approximation is not valid and n 
should deviate from the straight line as shown in Figure 10.5.) 
This is the number of electrons in the donor states. For 6>f much 
larger than &a these states are all occupied and n d = Nd- For 
&f — &d, they are half occupied and the curve lies below the point 
(Nd, &d)y shown as a cross, by a factor of 2. For lower values of &f, 
the approximation f = exp (&f — §> d )/kT may be used which gives 
a straight line for n d , with the same slope as for n. If extended, the 
straight line would pass through (Nd, &d)- 

Precisely similar conditions apply to the positive charges Nd, p, and p a , 
with the distribution for holes used in place of that for electrons. 

For the conditions of Figure 16.2, n is much larger than n d for all values 
of &f- This situation arises from the fact that kT is comparable to the 
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binding energy of an electron and N c is much larger than Nd- As a result, 
the greater number of states available to electrons in the conduction band far 
more than overweighs the difference in the statistical factor/ which favors 
the lower energy states. Under equilibrium conditions (where the heavy 
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S F IN ELECTRON VOLTS 

Fig. 16-3 — Solution for & F with Density Acceptors, N a , = 10 14 cm -2 , 
Density of Donors, N d> = 10 15 cm" 3 and T ^ 600°K. 

curves cross), a negligible number of holes are present in p and p a - Thus 
for this case 

n + N a = N d or n = N d - N a . (5) 

This means physically that the excess electrons introduced by the donors 
minus the acceptors have all been freed by thermal agitation and are in the 
conduction band. However, since p is negligible, the temperature is not 
high enough to break any appreciable number of bonds. 

For T = 600°K, the situation is as shown in Figure 16.3. Under these 
conditions the dominant contributions come from n and p and the values 
of JV and Nd do not influence the situation appreciably. This corresponds 
to a temperature for which the intrinsic behavior is reached. It is evident 
that neutrality requires that & F = (& v + S c )/2 so that p = n. 
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For Figure 16.4, T = 150°K, and n is more than 10 times larger than n d 
for all values of §>f and contributes most of the variable negative charge. 
Charge balance occurs when 

n + n d = N d - N a . (6) 



I i i i i I i i i i I 



i i I i i i i I i i i i I i i i i I i i 

(150°K) 



p+Nd + p a = n+N a + n d /_ 
/ 



n + N a + n d 

( NEGATIVE 
CHARGES) 




-0.1 0.1 

5 F IN ELECTRON VOLTS 



Fig. 



16^ — Solution for g/c with Density of Acceptors, N a , = 10 14 cm 2 , 
Density of Donors, N a , = 10 16 cm" 3 and T^ 1S0°K. 



That is, the electrons introduced by the excess of donors over acceptors 
neutralize the donors and acceptors with most of these electrons in the 
conduction band, only about 1 in 3000 being in donor states. 

For T = 50°K (Figure 16.5) the value of n d is larger than n for &p < & d . 
For &p > & d the donors are substantially filled; that is, most of the elec- 
trons have not been excited from the donor levels. The values of p andp a 
are negligible and are not shown in the figure, which shows the energies 
near G> d on an expanded scale. Again equilibrium corresponds to 



» + N a + n d = N d . 



(7) 



However, in this case where the donor levels are almost completely filled, 
it is more convenient to consider the holes in the donor levels rather than 
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Fig. 16-5 — Solution for gy with Density of Acceptors, N a , = 10 14 cm -2 , 
Density of Donors, N a , = 10 16 cm- 3 and T ^ 50 C K. 

the electrons. The number of holes is Nd — n d and leads to rewriting 
equation (7) as 

n + N a = (N d - n d ). (8) 

Here n + N a is the density of negative charges and (Na — n d ) is the net 
density of positive charges, that is, the donor levels occupied by holes. 
The variation of (Nd — n d ) with &f is similar to that of p a , and (N d — na) 
has its half value of N d /1 when & F = & d . Figure 16.5(b) shows (N d — »d) 
plotted versus &p as a solid line and n + N a as a dashed line. 

This method is still more appropriate at the lower temperatures of 37^°K 
and 20°K [(Figure 16.6(a) and (b)]. Here we have considered an addi- 
tional value of 10 12 for N a . For these temperatures, 6>f approaches &a so 
as to leave the donor levels partially filled as T = is approached. How- 
ever, if N a were equal to zero, it is evident that &p would lie halfway 
between S c and &d as T approaches zero, just as 6p lies halfway between 
the filled and conduction bands in the intrinsic range. 

In the next section we shall analyze further the behavior of Sp and will 
discuss the relationship of the slope of In n vs \/T plots to energies. 
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Fig. 16-6 — Solutions for g P with Densities of Acceptors, N a , = 10 14 cm -3 and 10 12 
cm- 3 , Density of Donors, N d , = 10 15 cm- 3 , for T ^ 37.5°K and 20°K. 

16.4 ACTIVATION ENERGIES 

The results obtained just above can be put in analytical form with 
instructive results. As T approaches zero, fid approaches Nd — N a and 
n approaches zero. We wish to analyze the behavior of In n as a function 
of T and to interpret this in terms of an activation energy. For low T's 
such that n is negligible compared to »<j, the neutrality condition is 

n d = N d - N a (1) 

and since 

n d = m (2) 

we find 

f=(N d - N a )/N d . (3) 

Since by assumption Nd > N a> this gives f a value between zero and unity 
which can be solved for &p as follows: from 



f = 



1 + exp [(S - &F)/kT\ 



(4) 
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we obtain 

6 - S F = kTln (1 -f)/f, (5) 

and substituting & d for § and (7V d - iV a )/iV d for f we obtain 

Sf = & d + kT In [(N d - N a )/N a }. (6) 

This shows that for temperatures so low that n is negligibly small compared 
to n d (so that n d may be taken to be N d - N a ), & F is linear in T. For 
N d = 2N a , the N d states are just half filled at T = and & F does not 
depend on T since In 1 is zero. For N d > 2N a , the slope is positive and & F 
increases with increasing T; the physical interpretation is that for this 
case the donor states are more than half full and & F must, therefore, lie 
above g d . For N d < 2N a , the states are less than half full and & F always 
lies below 6> d . 

When & F is varying, what activation energy will be obtained from a 
In n vs 1/7 plot? The interesting result is that S c - & d will be obtained 
as may be seen as follows: We have 

n = N c exp [(S* - & c )/kT] 

= 2(2irmkT/h 2 )* exp [& d - & c + kTln (N d - N a )/N a ]/kT 

= 2(2irmk/h 2 )*T*l(N d - N a )/N a ] exp (& d - & e )/kT (7) 

from equation (2) of Section 16.2. From this equation it is seen that the 
exponential dependence only involves & d - & c and that the slope of & F 
versus T affects^ the coefficient. Thus at sufficiently low temperatures, a 
plot of In (nT~ 3A ) versus 1/T should give the correct value for S c - & d . 
This result can be generalized in terms of a plot of & F vs T, Figure 16.7. 
This plot summarizes the results obtained in Section 16.3 and just above in 
this section. When N a = 5 X 10 14 , or half of N d , & F starts from T = 
with zero slope. For N a greater than this, it starts downward. For any 
particular curve of & F versus T, the plot of In nT~ H will vary as 
(& F - & c )/kT. It will, therefore, have as slope on the 1/T plot of 

d ( &r ~ &c \ = & F - & c J_ d& F _ & F - & c J_ d& F 
d(l/T)\ kT J k + kT'd(\/T)~ k kT'lfJ¥ 2 



where 



-i[«'-r£-«.]-jlV-« 



(8) 



s,*-s,-r^. (9 ) 



This is just the slope associated with extrapolating a point on the & F vs T 
curve tangentially back to T = as is shown in Figure 16.8. The value 
& F * so obtained is the apparent Fermi level so far as activation energy 
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Fig. 16-7— The Fermi Level for Electrical Neutrality Plotted against Temperature for 
Several Densities of Acceptors. 

slopes are concerned. For the example chosen, it comes above & c so that 
the apparent activation energy is negative. This anomalous result is just 
what would be expected in the saturation range. In this range of tempera- 
ture, the value of n is independent of T while N c increases. Thus if N c 
were treated as a lumped set of states all at energy S e , then increases in T 
correspond to decreases in f for these levels just as if the Fermi level were 
above S c . This shows why the apparent activation energy turns out 
negative as shown in Figure 16.8. _ 

From these considerations it is evident that in general no simple interpre- 
tation of the activation energies is possible from the l/T plots. ' Instead 
it is necessary to make detailed comparisons between theoretical curves 
based on exact solutions of the models discussed in Figures 16.1 to 16.7. 
Such comparisons are presented for a number of cases for silicon alloys in 
the paper of Pearson and Bardeen. 

1 G. L. Pearson and J. Bardeen, Phys. Rev. 75, 865-883 (1949). 
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Fig. 16-8 — Apparent Activation Energy for N a = 10 u . 
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Fig. 16-9 — Plot of Experimental Data for Several Silicon Samples for Comparison 
with Equations (10) and (11) of Section 16-4. 
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The simplest of these comparisons is made in the range in which both 
n andp are appreciable. Using the approximations of equations (11) and 
(14) of Section 10.3 for these gives 

npT~ 3 = 4(2irmk/h 2 ) 3 exp (g, - & c )/kT 

= 2.32 X 10 31 exp (S, - g c )/*:Tcm- 6 °K~ 3 . (10) 

The data obtained on the basis of this formula falls on a straight line as 
shown in Figure 16.9. However, the value of the coefficient obtained from 
the experimental data is found to be about 34 times larger than predicted 
from formula (10). This result can be explained by supposing that g c — 
§> v , the energy gap between the bands, is temperature dependent and de- 
creases, probably because of thermal expansion, as $>q — /3T. Using this 
value in equation (10) modifies the exponential as follows 

exp (S, - S e )/kT = ef" h exp (S G /kT). (11) 

The value of |8 has been estimated independently from measurements of 
photovoltaic effects in silicon n — p junctions by F. S. Goucher and 
H. B. Briggs and appears to account for most of the factor of 34. This 
conclusion is in agreement with the other methods of estimating j8 discussed 
in Section 12.8. The remainder may be due to the electrons and holes 
having effective masses different from the free electron mass. 

Problems 
(See also Chapter 10) 

1. Derive the Fermi-Dirac distribution law by considering two pertur- 
bations 5g and 5g as was done for the Planck distribution. 

2. Consider the distribution for electrons trapped on donor levels taking 
into account the effect of spin which allows two possibilities for each trapped 
electron. (Electrostatic effects will prevent more than one electron from 
being trapped on one donor.) Show that there will be 

2 n *N d \/n d \(N d - n d )\ 

ways of putting n d electrons on N d levels and hence that 

n d = N d /[\ + (|) exp (S„ - & F )/kT\. 

This result is used in Problem 1 of Chapter 10. 

3. Show that In W for the Planck distribution is really a maximum by 
considering the value of S In W for a large perturbation with 5S' = — 5g. 

Remark 

The following problems furnish an outline of one of the methods of 
showing how thermodynamics follows from the statistical mechanics of 
quantized systems. 1 They also serve to introduce the partition functions 

1 This treatment is similar to that of E. Schroedinger, Statistical Thermodynamics, Cam- 
bridge, 1946, but differs by laying emphasis on a more general heat bath. 
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Z and z which are very useful for calculating average properties and their 
fluctuations. The basic assumption is the assumption of equal a priori 
probabilities for all quantum states of the system. Problem 4 introduces 
the canonical ensemble and Z as artificial aids and shows that averages 
for the canonical ensemble are the same as averages for one particular 
microcanonical ensemble contained in the canonical ensemble. Thus the 
important terms in Z all arise from one particular microcanonical ensemble. 
In Problem 5, the number of states in the microcanonical ensemble is 
evaluated in terms of Z. In Problem 6, it is shown that the important 
terms in the canonical ensemble arise chiefly from one particular dis- 
tribution and that the larger the system becomes, the less important are 
other distributions. This distribution must, of course, be the most 
probable distribution. These problems thus show that the canonical 
ensemble may be used in place of the microcanonical ensemble for calculat- 
ing the average properties of a system, the great advantage of this result 
being the possibility of using the partition function Z. The canonical 
ensemble is next shown in Problems 9 and 10 to have a physical meaning; 
in fact it represents the distribution, not of an isolated system, but of a 
system in contact with a heat reservoir. In Problems 11 to 14, the average 
pressure of system in equilibrium with a heat reservoir is expressed in terms 
of Z, and the identification of statistical and thermodynamical properties 
is given. 

4. Show that appreciable deviations from the equilibrium distribution 
are negligible by considering the average over the canonical ensemble. The 
canonical ensemble is introduced at this point as a mathematical aid; 
it is defined as follows: Give each quantum state for the system with 
energy S a a weight exp (— @& a ). Let 

Z = £ e-" 5 ". 

all a 

Z is called the states sum (Zustandssumme) or partition function. The 
weighted average energy over the canonical ensemble is thus 

<S> = £ S a e-' 5 7Z = -^InZ, 

where ( ) is introduced to signify the weighted average for the canonical 
ensemble. The average squared energy is 

Suppose that the system can be considered to be made up of a large number 
of similar and substantially independent subsystems, 1, 2, • • • , Q. Then 
if the energy levels of one of those subsystems are s a , S/3, s T , etc., then each 
energy of the whole system may be written as a sum of these energies. 
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Show that 

Z = z« 2 

where z equals the states sum for one of the subsystems. Show that 

<S 2 > - <S> 2 = Q[( S % - ( S ),*] 

where the ( ) t expressions represent averages over the terms in z. From 
this conclude that for a large system, for which Q > 10 10 , for example, 
that a negligible contribution to Z is made by states which have \$ — (&)\ 2 
> 10 _6 <S) 2 , provided that (e 2 ) z is not much greater than (s) z 2 . How does 
this prove that averages over the canonical ensemble of a property of the 
system equals the average over a microcanonical ensemble with § = {&) ? 
Show also that the formula 

(which is used in the preceding proof) leads to the harmonic oscillator 
energy, equation (22) of Section 16.1, when z is calculated for the energy 
levels (» + \)hv. 

5. From the result of Problem 4 show that 

In ^»«S» = /3(§) + In Z, 

where W m {{&)) is the number of states in a microcanonical ensemble of 
energy in a small range near (S). The quantities Z and {&) are functions 
of )3, the as yet unidentified parameter in Z. Do this by writing Z in the 
form 

where the states are lumped into groups of W m {H>) states of approximately 
the same energy S. Show that, for any reasonable choice of the energy 
intervals used for the groups, the value of In W m {&) is independent of the 
choice, provided that Q is large. [For example, show that if the energy 
interval is chosen as proportional to Q 1 , Q°, or Q~\ the value of In W m (JS) 
is not affected.] 

6. Show that the major contribution in Z comes from states in sub- 
stantially the same distribution. This is done by showing that there is an 
average distribution represented in Z and that deviations from it are 
negligible. Accordingly we find the average number (Q a ) of subsystems 
with energy s a and also the average squared number (Q a 2 ). The mean 
squared deviation of Q a is then shown to be proportional to Q so that the 
fractional deviation is proportional to \/Q and is negligible for large Q. 
The analysis is carried out as follows: Each term in Z is of the form 
exp —^[QcfioL + Qpep + Q y e y + • • • ] where the Q a 's represent the number 
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of subsystems in each state. Evidently 



and 



Inserting z Q for Z, obtain 

(Q«) = Q 



e"* f « 



z 



(Qa*) = (Qa) 2 + Q L 



1 -- 



1 



Show that for large values of Q, the fractional mean square deviation of Q a 
is negligible so that the important contributions to Z all come from sub- 
stantially the same distribution. 

7. From Problems 5 and 6 prove that for large systems (and finite values 
of j8) that 

In /F«S» = P(&) + In Z 

where W applies to the most probable distribution. 

8. Show that Problem 6 reduces directly to the Planck distribution 
[equation (24) of Section 16.1] if each oscillator is taken as a subsystem. 
Subsystems in general need not be so simple. (The same procedure can- 
not, of course, be applied to the Fermi-Dirac distribution since the Pauli 
principle precludes treating individual electrons as independent sub- 
systems.) 

9. Show that a heat reservoir, which may be taken to be a system in 
which Q is made arbitrarily large, satisfies the condition that 

5 In fV(&) = $65, 

for large changes 5§. Hint: express Problem 7 in terms of Q and z(J3) 
and let Q — > <*> . 

10. Show that a system in equilibrium with a heat reservoir with param- 
eter fe will have a relative probability exp (— $R& a ) of being in a state 
of energy 5> a . Do this by using the assumption of equal a priori prob- 
abilities, the method of combined systems used in Section 16.1 and the re- 
sult of Problem 9. This shows that a system in equilibrium with a heat 
reservoir is distributed in a canonical ensemble with properties derived 
in Problems 4 to 6. 

11. Suppose that the energy of each state & a of the system is a function 
of a mechanical variable, such as the volume V. Then the pressure exerted 
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by the system in state 6> a will be 

Pa ' ~ IV ' 

The state sum will then be a function Z(V,8). Show that the average 
pressure is 

(p) = --^-AnZ. 

w bqv 

12. Next consider the state of the system as specified by V and ft Let 
the state of the system change by 88, 8V. Show that 8 In W, as defined in 
Problems 5 or 7, can be written in the form 



5 In W = 8 [- ^£^88 - ^-^8F+ - n °-^8V 



d 2 In Z „ d 2 In Z „ IdlnZ 

5— 88 SV A 

dB 2 M dVdB B dV 

= 8[8(&) + (p)8F] = BS2L, 

where 52 must be the heat furnished by reservoirs in order to satisfy 
conservation of energy. This shows that 853. is an exact differential, 
since it is equal to S In W and \nW "\s a function of 8 and V. From this 
show that, in a Carnot cycle operating between reservoirs 8\ and 182, the 
thermodynamic efficiency is 

d% - d% i ft 



dSL x ft 



2 



This shows that ft : ft = T 2 : Ti, according to the definition of the Kelvin 
or absolute temperature scale. We may, therefore, define a constant k 
as the ratio 7\/ft. (The proof that k is Boltzmann's constant requires 
the treatment of a particular model such as an ideal gas or a harmonic 
oscillator, like that discussed in Section 16.1.) 

Show that the reasoning of Problems 11 and 12 can be carried out for 
an electrical system, such as a storage battery, by replacing the volume 
V by the charge passed through the battery and the pressure p by the 
voltage. 

13. Show that except for an additive constant the entropy of a system, 

defined as J M/T = k I BdQ, is 

§ = * In W = (1/T)<S> + k In Z = -kB 2 — (— \ ■ 

Show, using the relationship between BdQ and the derivatives of Z in 
Problem 12, that the term on the extreme right of the last equation is a 
valid expression for the entropy even if T is so low and 8 so large that the 
approximation of Problem 5 is invalid. Hence show that the zero of 
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entropy may conveniently be chosen so that at zero temperature the 
entropy is 

§0 = k In JV(&min), 

where JF(S m in) is the number of quantum states of minimum energy for 
the system. (Is the entropy equal to zero at T = for a hydrogen atom ?) 
14. Using the above definition of § in terms of Z, show that 

- (1/0) In Z = <S> -TJsJ, 

where 7 is the quantity frequently called the free energy. Show the re- 
lationship of the definition of entropy of Problem 13 to the equation 



(: 



dTjr 



15. Show that the mean square deviation of energy of a system in equi- 
librium with a heat reservoir is <(S — (S)) 2 ) = kTc v T where c v is the 
specific heat of the system at constant volume. Hint: use 

Cv = = —KB 

V dT d$ 

and express all quantities in terms of Z. 



CHAPTER 17 

THE THEORY OF TRANSITION PROBABILITIES 
FOR HOLES AND ELECTRONS 

17.1 INTRODUCTION 

This chapter covers the most involved material presented in this book, 
as may be judged from its length and the fact that all of its sections, save 
this introduction, are interdependent. In it Schroedinger's equation for 
a system of atoms and electrons is developed into a form suitable for 
interpretation in the language of transition probabilities. The argument 
falls into three main parts: (1) finding a set of wave functions suitable for 
describing a system consisting of an electron moving through a crystal 
whose atoms are vibrating, (2) finding how this wave function changes with 
time, and (3) interpreting the results in terms of transition probabilities. 

It is not practical to follow the argument in this sequence, however, 
without making it seem unnecessarily abstract. It is pedagogically far 
preferable to treat first (3) and part of (2) and thus to show in general 
terms how solutions of Schroedinger's equation lead directly to transition 
probabilities and how the matrix elements obtained from the starting wave 
functions fit into this scheme. Accordingly transition probabilities are 
treated first in Section 17.2. The treatment makes use of the Weisskopf- 
Wigner method of dealing with transition probabilities, which has not to 
the author's knowledge been applied previously to the conductivity 
problem. It has been employed here, in spite of its complexity compared 
with simpler methods, in order to justify the details of the simple picture 
of transition probabilities used in Chapters 8 and 11. 

On the basis of Section 17.2, it is seen that the transition probabilities 
depend upon the matrix elements for an electron-atomic vibration collision. 
These matrix elements are evaluated by carrying out part (1) in Sections 
17.3 to 17.5. In terms of the wave functions so obtained, the matrix 
element is derived in Section 17.6. 

Figure 17.1 has been prepared to show the logical connection between 
various parts of the treatment. As in the case of other figures of the same 
sort, it will probably be of most value in reviewing the argument rather 
than in foreseeing it. 

Section 17.7 is concerned with showing the equivalence of the problem 
of one hole to that of one electron and then deals with the case in which 
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many holes and electrons are present — one of the problems discussed in 
Section 11.2. 

The transitions produced by impurity scattering are treated in Chapter 
11 and are not enlarged upon in this chapter. It should be noted, however, 
that the procedures of Section 17.2 would apply equally well to the matrix 
elements for impurity scattering. It is also specifically pointed out in 
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Fig. 17-1 — Logical Structure of Chapter 17. 

(1) Is concerned with finding suitable wave functions; 

(2) With finding how the wave functions change; 

(3) With interpreting the results. 



Section 17.7 that the scattering process for holes is substantially the same 
as that for electrons independent of the scattering mechanism. 

The following list of symbols used extensively in this chapter is repeated 
here for convenience. Equation numbers are in parenthesis and section 
numbers are not. 

r T = position of the center of gravity of the crystal plus the excess 

electron. 17.4. 
r = position of the excess electron in respect to the center of the 

crystal. 17.4. 
m e = mass of the electron (used to distinguish this from m below). 

17.4. 
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m = reduced mass of electron plus crystal (used in this meaning 
only in Chapter 17). 17.4. 
M c = mass of crystal. (1) 17.3. 
Mt = mass of crystal plus electron. 17.4. 
M=\M C = mass associated with a normal mode. (11) 17.3. 
N s = number of atoms in the crystal. 17.3. 
§ = wave function for entire system of crystal plus excess electron. 

(5) 17.5. 
X = factor of $ corresponding to r^. 17.4. 
■ty = factor of § corresponding to r. 17.4. 
<j>(«'s) = factor of § corresponding normal modes excluding motion of 
center of gravity. (22) 17.5 and (5) 17.6. 
<Pi = v(»ka, n-kai qhac, qkccs) = factor of <{>(«'s). 17.5 and (4) 17.6. 
G&J, k, a = polarization unit vector, wave vector, polarization index. 17.3. 

Some additional symbols such as q, A and x(*) are defined and used in 
Section 17.7. 

17.2 THE THEORY OF TRANSITION PROBABILITIES; 
MEAN FREE PATH IN TERMS OF MATRIX ELEMENTS 

In this section we shall derive the formula for the mean free path in 
terms of the general transition theory for a quantum-mechanical system. 
In this treatment the formula for the matrix elements, which determine 
the transition probabilities, is introduced as an added assumption, the 
derivation of the expression being given in Section 17.6 for atomic 
vibrations. 

The system considered consists of an array of atoms, which are held 
together by valence electrons, plus one excess electron in the conduction 
band. Except for the excess electron, the electrons are regarded simply 
as furnishing a medium which binds the nuclei elastically to their equi- 
librium positions; the nuclei can vibrate according to quantum-mechanical 
laws about those positions (discussed in Sections 17.3 and 17.5) and their 
vibrations interact with the excess electron so as to cause it to make transi- 
tions from one quantum state to another. From any given state of the 
system, in which the electron has momentum Pq, transitions may occur to 
states in which it has momenta Pj; each change of this sort may occur in 
two ways, depending on whether the electron gains or loses energy to the 
lattice .vibrations. It is found to be possible to treat these two possible 
transitions as a single transition by a simple means [Section 17.6, equation 
(33)]. For this reason we may consider the end states, to which a transi- 
tion is made from a state with momentum P , as uniquely specified by the 
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final momentum Pj. We may, therefore, describe the initial state of the 
system by § and the final states by *y, where the * symbols are functions 
of the coordinates of the nuclei and of the position and spin of the excess 
electron. 

We shall assume that the wave functions * ; are eigenfunctions of a Hamil- 
tonian UC which differs by a small perturbation term, we shall call it 6 lli, 
from the exact Hamiltonian 3C. This assumption, which is a general 
requirement of perturbation theory, will be justified by the developments 
in later sections. Accordingly, we may write 

" DC = X + %! (1) 

and 

3C *i = S,-* y . (2) 

If the system is initially in the state § at / = 0, the presence of ^i will 
cause transitions to other states $y, and our problem is to find the proba- 
bility that the system is in any particular group of states as a function 
of time. This is done by obtaining a solution to the time-dependent 
Schroedinger equation for 3C by expressing § as a series, 

where the «'s are functions of time and the initial state at / = corresponds 
to 

<*i — &jo> that is, «o = lj #;Vo = 0. 

The probability of finding the system in any group of states at a later 
time is 

£|<z 7 | 2 , summed over states of interest. 

Before considering the mathematics of the perturbation theory, we shall 
show the relationship of this problem to the treatments of Chapters 8 and 1 1 . 

On the Basic Equivalence of the Quantum and the Classical Models. During 
a collision an electron gains or loses energy to the lattice vibrations and it is 
this process which converts the electrical work I 2 R done on the electrons 
into heat vibrations. In any individual collision, however, the loss or gain 
of energy by the electron is negligible compared with its total energy, and 
we may neglect such changes while studying the scattering of the electrons. 
Thus, we suppose, as shown in Figure 17.2, that an electron initially in 
state Pq may make transitions to other states of the same energy, such as 
Pj; and from these it can make transitions to other states, and so on. 

We shall see later, under the heading Quantum Theory of Transition 
Probabilities, that the perturbation interaction introduces uncertainty in 
value of the energy after the transition. As a consequence of this, the end 
states do not lie exactly on the sphere of constant energy but instead are 
distributed chiefly in a thin shell, represented in Figure 17.3, with a spread 
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in energy given by ASt = h where To is lifetime of the initial state, dis- 
cussed below. This uncertainty in energy is so small, however, that we 
may neglect its effects for most purposes and consider strict conservation 
of electronic energy to apply. 



— -&(Pj)=S(P ) 



PROBABILITY OF TRANSITION 




PER UNIT TIME IS W jdrtj 



Fig. 17-2 — An Electron Initial in State Po of the Brillouin Zone Makes Transitions 
to Other States of the Same Energy According to the Classical Model. 

In accordance with implication (a) of assumption 1 of Chapter 8, the 
probability of transition from any one state to any other state should be 
independent of time. We shall now give this assumption mathematical 
expression in the form of equations, whose consequences (although not the 
equations themselves) we shall later show are equivalent to those of quan- 
tum-mechanical perturbation theory. The equivalence of these two treat- 
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Fig. 17-3 — The Relationships Between Element of Volume, Number of End States, 
and Velocity. (Discussed chiefly on page 489.) 



ments justifies the use of the classical picture of collisions and mean free 
times in visualizing processes and making calculations. 

In keeping with the procedure of Chapter 11, we consider in Figure 17.2 
a thin shell in momentum space bounded by two closely neighboring sur- 
faces of constant energy. Since changes in energy are neglected, we may 
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treat this shell as a surface and speak of the number of states per unit area 
or number of electrons per unit area. Thus we consider transitions from 
the initial state P to the group of states near Pj lying in the element of 
surface area dQj where dQj has the dimensions of (momentum) 2 . The 
probability of transition per unit time will be proportional to the area and 
may be written 

W*j&kj. (3) 

The total probability per unit time of making a transition from state Po 
is then 

W - I W jdQj, £2(S) = energy surface. (4) 

t/a(s) 

We similarly denote the probabilities of making transitions from state 
Pj by 



Wu and W, 



n 



■j = f WnXk. (5) 

t/a(S) 



We shall next use these probabilities to describe the statistics of the 
scattering process. 

It is helpful to think of the probabilities in terms of numbers of elec- 
trons ; this can be done by supposing that a large number N of identical 
systems start with an electron in P$ a.t t = 0. Accordingly we shall use 
the following definitions 

c*o = number of electrons still in Pq at time /, 
@j = number of electrons per unit area (of the 

equal energy surface of Figure 17.2) near 

Pj which have been scattered once, 
7i = number of electrons per unit area near Pj 

which have been scattered twice, 
etc. 

The number of electrons which have been scattered once and are in a 
range dQj and the total number ft which have been scattered once are 



Pjdttj and / fadQ 
«/a(S) 



o(s) 

The classification of electrons in terms of the number of collisions they have 
had simplifies the comparison with the quantum-mechanical theory. 
From these definitions and the assumption that the Wji are independent of 
time, it is evident that the rate of loss of electrons from a is Wqcio, giving 

4<* = ~#W «o = e-™, Wo = 1/to (6) 

at 



I 
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where To is the mean free time between collisions of Chapter 11. The rate 
of change of the singly scattered electrons in dUj is 

- PjdSlj = aoJVoid&j - Wfijd&j 
at 



h = #w«o - Wfii\ (7) 



and similarly 



ji ^JfVjiPidSlj - fT/Yi. 



These equations express mathematically the idea that an electron in a 
given state Pj has definite probabilities of being scattered per unit time 
into various other states P i} these probabilities being independent of time. 
This is in agreement with the picture used in Chapter 11 to calculate the 
mobility. Our main objectives in this chapter are (1) to show that 
quantum mechanics leads to the behavior predicted by these equations, 
thus showing that the simple physical picture used in Chapters 8 and 11 is 
correct, and (2) to evaluate the probabilities W. We shall treat (1) in 
this section, verifying in addition that the form of the quantities Wa is 
consistent with the principle of detailed balancing discussed in Chapter 11. 
The expressions for the Ws in terms of the constants of the material will be 
derived in Section 17.6. 

The Quantum Theory of Transition Probabilities. We shall next present 
a quantum-mechanical derivation of the collision formulae discussed 
above. The method employed is based on the Weisskopf and Wigner 1 
treatment of radiation broadening. In their method an approximate solu- 
tion is obtained for the decay of an initial excited state of an atom into the 
ground state with the radiation of a quantum of light. In our problem, we 
consider an electron in an initial state which may make a transition to 
another momentum state with radiation of a phonon. In the radiation 
case, the final state is specified by giving the characteristics of the emitted 
photon. In our case we can use instead the final momentum since the 
phonon emitted or absorbed is uniquely described in terms of the change 
in momentum. (The slight oversimplification of this statement is recon- 
sidered in Section 17.6 following Table 17.2.) The electron-scattering 
problem is more complicated than the radiation problem, because, imme- 
diately after one transition occurs, others are possible. It is possible to 
show for the cases of principal interest, however, that these other transi- 
tions may be treated according to the scheme of equations (6) and (7). 

1 V. Weisskopf and E. Wigner, Zeits.fur Phys. 63, 54 (1930); 65, 18 (1930). See also 
W. Heitler, The Quantum Theory of Radiation, Oxford University Press, New York, 1936. 
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We shall give a demonstration of some of these relationships in this section, 
leaving certain questions of rigor as problems at the end of Section 17.6. 

The Weisskopf-Wigner method uses considerably more advanced tech- 
niques than those usually employed in deriving transition probabilities. 
The more conventional treatments lead only to an initial rate of decay 
which is linear in time and thus represent the first two terms, namely, 
1 — (t/r), of equation (6). The Weisskopf-Wigner method leads to the 
proper exponential dependence. Both methods lead to the same formulae 
for t, and the reader who is unfamiliar with the more elementary method 
may find it advantageous to study some of the standard texts. 2 

The Weisskopf-Wigner method starts by expanding the wave function 
in a series of the form 3 

* = a e- w #o + E V**** (8) 

where the coefficients are unknown functions of time satisfying the con- 
ditions 

«o = 1, bj = at / = (9) 

and 

7ko = S , huj = &j. (10) 

The $y functions differ from $o by having one more or less phonon than 
$o in our case an( i ln the radiation emission case by having one more 
photon. Thus § is not one °f the i's in the sum of #y. We shall neglect 
in the first treatment the possibility of transitions from one #/ to another 
or to other functions in which more changes in the phonon field have 
occurred. With the aid of these assumptions the differential equation for 
the coefficient a is obtained from the time-dependent equation, 

3C* = (3C + %)# = (ih)$, (11) 

by multiplying by §o* an d integrating over all coordinates and by using 
DC #o = So*o: 

iha = 'Z'% )J -e*--'% (12) 

where ^oy is the matrix element for transition from toj: 

% ; . = [§0*^$^ (all coordinates). (13) 

The matrix elements of ^i, as follows from the treatment of later sections, 
are Hermitian and in addition ^ooj which is omitted in (12), is shown to 
vanish in Section 17.6. [As the text just preceding equation (28) of this 
section indicates, the perturbation due to %! is small compared to thermal 

2 Such as L. I. Schiff, Quantum Mechanics, McGraw-Hill Book Co., New York, 1949, 
Section 29. His equation (29.12) is equivalent to our equation (23) for W^. 
* E' represents a sura over all states omitting j = 0. 
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energies for electrons.] Similar manipulations on (11) with #** give 

iUj = %&*•«— #ao. (14) 

We shall next verify that an approximate solution of these equations is 
«o = e"^' (IS) 

fill ■„ ftW»;'— «o)— »1' 1 

h i = ~r~ ~i ^ — ( 16 ) 

in t{uj — too; — V 

where t\ is a constant which we shall shortly evaluate. 

In terms of i\ the probability that the electron is in state zero is |«o| 2 = 
exp ( — Irjt) from which it is evident that the transition probability and 
mean free time for state §o are 

W — It] and To = 1/2)7. 

_ Equation (16) above for bj follows at once from that for « and the dif- 
ferential equation (14). In order to satisfy the equation for «o we must 
have 

LGlf .Gil . 1 -[*'(«o —»»)+<)]< 
— ^2 v s C 17 ) 

h t(uj — w ) — v 

as may be found by direct substitution. (As we shall show below, this 
equation can be solved for r\ giving a constant value independent of t.) 
The. summation is next replaced by an integration, it being assumed 
(justified in Section 17.6) that G U ;' G ll,'o = | G lXo y 1 2 is a slowly varying func- 
tion of the end state j. The density of end states is (V/h 6 ) since changes 
in spin do not occur. For integration coordinates we use elements of area 
dtoj of the constant energy surfaces and the energy itself. The element of 
volume in P space is thus 

dV P = dUjgiP^dS, 
where 

g(Pj) = 1/|VpS(P)| = 1/py, VJ = \ Vj \ 

is the separation of the energy surfaces per unit energy difference. 4 (As 
discussed above, we neglect the energy of the phonon.) For brevity we 
shall introduce pj (a function of Pj) which gives the number of end states 
ds per unit energy per unit surface; by definition we then have 

V V 

Pjdiljd& = -3 — d£ljd6>, pj =■ Tg — • (18) 

it Vj ft Vj 

This relationship, together with certain other features of the integration 
over P-space, is shown in Figure 17.3 on page 485. 

The integration may be carried out approximately making use of known 

* See Section 11.2 for a further discussion of dVp. 
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integrals. It is convenient to replace d§> by hdwj, which then brings 
equation (17) for jj into the form 

|SW .12 i g[t(u>o— <•>/) +i]( 



'=/ 



h i(d0j — co ) — t] 



PjdwjdQj. (19) 



The factor involving a>o — coy is largest for transitions which conserve 
energy, and for these its value is (1 — e vt )/ri; for large differences between 
coy and coo it is usually regarded as negligible. If we consider the important 
contributions to come from so small a range of coy near co that the other 
factors in (19) are constant, we can make use of the equations 

P+ L 1 

► »«/—! 






s. 



~[i((i)Q—b)j)-\-7l\t 



dcoj = 


l+T, 


17>0 
r?<0 


ioij = ' 


\-2t, 

i o, 


T? > 

17 < 0. 



£,_>«, «/-l ;(coy — co ) 

(See Problems for a discussion of the integrals and convergence questions, 
and with their aid carry out the integration over coy.) This procedure 
reduces equation (19) to 

Jr> |g]i .|2 
L-^ TPjd&i (20) 

where the integration is over the surface <§y = &o. Thus for all values of 
/ equation (17) is satisfied by the same value of r? so that (15) and (16) 
constitute a good solution for the time-dependent Schroedinger equa- 
tion (11). 

We shall next show that the differential contributions to r\ can be inter- 
preted as transition probabilities per unit area of the energy surface. To 
do this we consider the probability that the electron be found in the element 
of area dQj with some unspecified energy Tkoy. This probability is the sum 
of the appropriate |^y| 2 and is 

, /Via ,, , rl%-ol 2 i + e- 2 "' - 2e-"' cos (coy - o,„)/ 

dilj I \bj\ pjhduj = dQj J — 2 2 dwjPi 

J J n (coy — coo) + rf 

1 i gi r • I 2 

= ^d-e- 2 " ( )^^2 1 rpy^y (21) 

where we have once more taken the other variables as constant over the 
important range of integration for coy and have allowed the limits to extend 
to ± oo ; the necessary integrals are standard forms. 5 As is shown in 
Figure 17.3, the important range of integration is that for which A<S = 
|<Sy — <§o| = h*l corresponding to |coy — co | = rj. This result is consistent 

5 B. O. Pierce, A Short Table of Intends, Third Edition, formulae (480) and (490), Ginn 
and Co., New York, 1929. 
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with the uncertainty principle in the form A&At = h and with the inter- 
pretation At = 1/rj = 2to where To, as we shall show below, is the lifetime 
of the state § - For cases of practical interest, A& is much less than § 
so that the spread in energy is relatively unimportant; this point is also 
discussed shortly. 

If in the classical equations we suppress second collisions, so as to have a 
situation corresponding to the quantum-mechanical case just treated, we 
find the fraction of the electrons which have been singly scattered by 
solving the following simplified form of (7) 

Pi = ^V*o = w oj e- w *, 

thus obtaining 

fc = ±r (1 - e- w *Woi (22) 

which is equivalent to (21) if we make the identifications 

^o; = ^2x Pi (23) 

1/to = fF = 2, = f W oj JQi. (24) 

Thus the behavior in time for the Weisskopf-Wigner approximate solution 
leads to the classical behavior with the interpretation that transition 
probability per unit area of the constant energy surface is given by (23) 
and the mean free time t is given by (24). 

The Principle of Detailed Balancing. The form of equation (23) allows 
us to verify that the principle of detailed balancing holds for the collisions 
considered here. Thus if we consider a thin energy shell of the sort dis- 
cussed in Chapter 11, Figure 11.1, we can show that under equilibrium 
conditions the number of electrons making transitions from an element of 
volume d&dtti to d&dQj is just equal to the number making transitions from 
d&dQj back to d&dtti. The reasoning given in Section 11.2 is based on the 
equality of PiWa and pjWji. This equality follows at once from equation 
(23) since we can write 

PiWa = p,-|%-i| a 2xpy/ft = PiPi^ji^nlv/h = PiWa. (25) 

The third expression, which is symmetrical in /' and j, follows from the 
Hermitian property of ^i which leads to %.,• = ^j*. 

The modifications required by the Pauli exclusion principle are discussed 
in Section 17.7 and also in Chapter 11 in which the result (25) is used. 

Critique of the Quantum-mechanical Treatment. The derivation of (23) 
and (24) suffers from the flaw that second collisions were arbitrarily pre- 
vented. It is possible to extend the theory so as to include these but at the 
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expense of additional questions of convergence. We shall outline the 
method here and give some additional details in a problem, leaving the 
actual manipulations as an exercise to the reader. 

We have found that any given state, described by a , makes a transition 
to states bj so that a varies as exp ( — r)t). Thus the effect of interaction 
with b states causes a to vary according to the equation 

iha = ihria . 

This suggests that, in dealing with the decay of any one state, we can 
replace the effect of the states which it excites and to which it decays by 
writing —ihria in place of the sum of ^oA exp [*'(w — uj)t), which occurs 
in the equation for iha [that is, (12)]. If this simple procedure is legiti- 
mate, we may represent the effect of the states to which the bjij series 
may decay by writing 

ttij = %-oe' ( "'~"°"«o - ih-mh, (26) 

the rij term being obtained in just the same way as rj except that the 
initial state is §j instead of § - If this approximation is made, another 
solution of the Weisskopf-Wigner type 6 can be obtained for the coefficients bj 

b > = ~ir - v ^ ;— (27) 

tk t(wj - w ) - V + Vi 

and it can be shown that these bj once more lead to the classical probabilities 
when a new set of transition probabilities Wj k are defined for the transitions 
from ij to other states. 

This process may be pushed farther on so that the states arising from 
decay of $» are in turn considered, etc. In this way the analogy between 
the quantum theory and the classical collision theory may be made as 
complete as may be desired. 

There are, however, some annoying questions about the validity of this 
procedure which we endeavor to dispose of in Problem 4 at the end of 
Section 17.6. 

One point which we may discuss here is the validity of replacing l^oy^Py 
by a constant during the integration over wj. The important range of 
integration is that for which \uj — u \ = rj = 1/2t . This corresponds to a 
change in electronic energy of S& = h/2T . For an electron in germanium at 
room temperature, t = 10~ 12 sec so that 5& = 10 -15 erg or about kT/50. 
Hence the variation of energy over the important range is only a few per 
cent of the energy of the electron and, consequently, the changes in | 6 U j| 2 Pi' 
may be neglected. (See, however, Problem 2, for other questions of 
integration.) The criterion that the important range of /koy be small 

In fact very similar to the one they used in considering two successive radiation processes. 
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compared to kT has been discussed recently by Seitz 7 who expresses the 
requirement in the form n > 30 cm 2 /volt sec. 

The Calculation of the Mean Free Time and Mean Free Path. As we shall 
show in Section 17.6, the value of the matrix element can be written 8 

|%o| 2 = AT IV (28) 

where T is the absolute temperature, V the volume of the crystal, and A is a 
constant depending on the elastic properties of the crystal, the mass of the 
atoms, and certain attributes of the wave functions but not upon 7*o or Pj- 
The formula states that | G LL j0 [ 2 is independent of the change in direction of 
the electron's motion, so that, after being scattered, the electron is equally 
likely to have momentum in any direction. Since the values of Wqj do 
not depend on the direction of Pj, the integration over dtlj in equation (24) 
in this case thus reduces simply to multiplication by 4ttP 2 . Using the 
formula vq = \P \/m n , which relates electron velocity Vq to momentum and 
effective mass m n , we may evaluate (24) as follows: 



- = f fF oj dQj = ^ Pj ]% \H wPo * = ^r^I1^4xPo 2 
T t/(as.) h h \_h v _\ V 



= I rr ojuiij = — pj | -ujoi tttx o — — 1 Ta — I ~T7~ ■«" " 
To 

ATm n 2 



IT, 



i 



v = v /l{T). (29) 



This expression shows that the transition probability is proportional to v$, 
corresponding to a mean free path 1{T) independent of velocity and given 
by the formula 

«""^?- (30) 

It is perhaps worth while to point out that the constant mean free path 
independent of energy of the electron results directly from the fact that the 
matrix element is independent of the electron's initial and final momenta. 
This independence of mean free path upon energy will apply even if the 
bands are degenerate, provided only that the energy in any direction of 
P-space varies as P 2 . If this last is true, then the area of the constant 
energy surfaces will also vary as P 2 or as the energy & and the spacing 
between surfaces will vary as 5>~ **. Consequently, the transition prob- 
ability will vary as 6 X &~ ** or S>^ and this is proportional to velocity. 
Hence even if the energy surfaces are quite complex, the mean free path for 
a given direction of motion will be independent of velocity. 

As shown in Chapter 11, the expression for constant mean free path leads 

7 F. Seitz, Phys. Rev. 73, 549-564 (1948). 

8 This effective value includes the contributions both of processes involving gain and of 
processes involving loss of energy by the electron. 
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to a mobility 

M = 4//3 V2rrkT/m n . (31) 

The treatments of atomic vibrations and their interactions with the electrons 
in the following parts of this chapter establish the expression for I^jjI 2 , 
show that it has the form AT/V, and evaluate the constant A. 

The equivalence of the scattering of holes to the scattering of electrons 
and related questions are discussed in Section 17.7. 

Problems 

1. Evaluate the integrals used in equation (19). This may be done by 
contour integration in the complex plane. The first integral is elementary 
and the second is obtainable from the contour from — R to +R on the real 
axis plus the semi-circle below the axis. (See also Whittaker and Watson, 
Modern Analysis, Fourth Edition, p. 123, problem 15. Cambridge at the 
University Press, 1927.) 

2. Both integrals considered in connection with (19) are improper in- 
tegrals and their values depend upon how the limits ± oo for uj are ap- 
proached. The exponential term will converge unless j ^Uojj 2 increases 
more rapidly than |co — w,-|. However, the other term may give an 
imaginary component of the form 

e— >0«/-» t/uo+e (C0j — 03q) 

Compare this with the second-order perturbation energy [see, for example, 
Scruff, loc. cit., equation (25.12)] and show that its effect is to correct the 
frequency of the state § to account for the average energy of the system 
as a whole. 

3. Substitute bj of equation (27) into (12); show that (12) is satisfied 
and v given by (20). Calculate the value of 0, for these b/s and show that 
the expression which replaces (21) is the correct solution for equation (7) 
of the collision theory provided the interpretation Wi = 2rn is used. 

4. If §(/) is a solution of Schroedinger's equation (when magnetic fields 
are absent), then $*(—/), which runs backwards, is also a solution. Yet 
equation (12), a = eT vt , obviously cannot be run backwards indefinitely. 
Show that, att = when all bi = 0, d = and not — »j and hence that the 
Weisskopf-Wigner approximation must break down at / = 0. To what 
solution for negative / will the W-W approximation actually correspond ? 
Why is it that the W-W solution can be reversed in time while those of the 
collision equations (6) and (7) cannot? 

17.3 THE NORMAL MODES FOR ATOMIC VIBRATION 

As outlined in Figure 17.1, several steps are required in calculating the 
matrix element for a transition involving changes of electronic and vibra- 
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tional state of the crystal. In this section we initiate this treatment with 
an analysis of the motions of the nuclei, regarding the valence electrons 
simply as a medium for exerting internuclear forces. The justification 
for this procedure is based on the high ratio of nuclear mass to electron 
mass. As a consequence of this difference, the bonding electrons move so 
much more rapidly than the nuclei that their wave functions are the same 
as if the nuclei were momentarily stationary. The energies of the wave 
functions of the bonding electrons depend on the nuclear coordinates, 
however, and as the nuclei move these energies change. The work done 
by nuclei on the electrons as the nuclei move represents an effective poten- 
tial energy for the nuclei and can be introduced as such in the Schroedinger 
equation for the nuclear coordinates. In this case the electrons can be 
thought of as moving so rapidly that the nuclei are affected only by the 
average force exerted by the electrons and this is given correctly by the 
use of the electronic energy as an effective potential energy. 1 

Accordingly we consider a crystal containing N s atoms. We assume 
that the crystal structure has one atom per unit cell so that all atoms have 
the same mass and are otherwise equivalent. We shall assume that peri- 
odic boundary conditions apply so that we may think of atoms located 
at x = A x — a, where A x is the boundary-period in the ^-direction and a, 
the lattice constant, as being neighbors of atoms at x = 0. If the crystal 
as a whole undergoes translational motion, atoms crossing the x = A x 
plane may be thought of as simultaneously re-entering at x = 0, no 
momentum being involved in the transfer back. Alternatively, we may 
imagine that the crystal as a whole moves outside of the block from 
x = to x = Ax by considering that the potential energy between any 
two nuclei may be represented as a periodic function of their coordinates 
Xiyi%i and Xiy 2 z 2 w hich satisfies the equation 

°0(a?! - x 2 ,yi - y 2 , zi - z 2 ) 

= T) {xi - x 2 + n x A x , yi - y 2 + n y A y , z x - z 2 + n z A z ) (1) 

where A x , A y , A z are the periods and the n's are integers. These two 
possibilities are indicated in Figure 17.4(b) in which the atoms of the crys- 
tal and their periodic "images" are shown. The atoms a and b outside 
the boundary are equivalent to their periodically related images a and b 
inside the boundary. 2 

A system of this sort can exert torques on itself so that the angular 

1 Detailed treatments of the separation of nuclear and electronic coordinates are given in 
many texts such as F. Seitz, The Modern Theory of Solids, Chap. XIV, McGraw-Hill Book Co., 
New York, 1940; H. Eyring, J. Walter, and G. E. Kimball, Quantum Chemistry, Chap. XI, 
John Wiley & Sons, New York, 1944. See also R. P. Feynman, Phys. Rev. 56, 340-343 (1939). 

2 For a discussion of the effect of various boundary conditions see M. Born, Proc. Phys. 
Soc. Lond. 54, 362-376 (1943), 
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momenta are not constants of motion. Thus if the crystal as a whole is 
rotated slightly as shown in Figure 17.4(c), the "images" produced out- 
side the boundary by the periodic boundary conditions are seen to exert 
forces on the atoms inside and these forces tend to rotate the crystal back 
to parallelism with the periodic boundaries. This feature of the model is 
somewhat artificial; one can imagine, however, a two-dimensional crystal 
in which these boundary conditions would apply by following the line of 
reasoning given in Section 15.9. Furthermore, the general nature of the 
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Fig. 17- 

(a) Normal arrangement. 

(b) Translated arrangement. 

(c) Rotated arrangement showing how the crystal can exert a torque on itself. 

results obtained with the periodic model would almost certainly be repro- 
duced, although less conveniently, by using other more physical conditions. 
The Normal Modes. Our problem in this section is to obtain a solution 
of the Schroedinger equation for the motion of the nuclei. In this treat- 
ment we shall use three sets of vectors, if,, if/ and 8 if,-, to describe the 
positions of the nuclei, the choice being dictated by convenience for the 
later development. These if's are defined as follows: A certain reference 
position in which the nuclei are perfectly arranged on a periodic lattice is 
selected for convenience. In this arrangement one of the atoms, called 
zero, is selected as the origin. The vector positions of the other atoms 
constitute the set if,- so that if, is a translation of the lattice. The vector 
i x A x , being an integral number of lattice constants, lies on an extended 
point of the if, lattice. In the disturbed arrangement of the lattice, atom 
* moves by an amount 8if, so that 

8if, = Ri — Ri (2) 

or, in words, 

displacement from reference position 

= actual position minus reference position. 
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In terms of the position vectors R/ and the conjugate momenta P„ 
Schroedinger's equation for the nuclear motion is 



\£iV ,+ ' B <*'- ••■*»■'>]* = 5 * 



(3) 



where Pi = (h/i)VRi' and the subscript indicates the coordinates of the 
nucleus or atom are involved. For small amplitudes of vibration, 13 may 
be expanded as a quadratic form and a set of normal modes obtained 
having coordinates and momenta qt and pi. For periodic boundary con- 
ditions these modes can be expressed in terms of running waves like those 
of Figure 5.6; for a three-dimensional crystal, however, the modes can 
propagate in space and, furthermore, each atom may vibrate in three 
dimensions. In order to arrive at these running wave solutions, we shall 
take two preliminary steps. 

We consider first the classical problem of finding normal modes. For 
this case it is advantageous to use the set of quantities Si?,- which represent 
the deviation of R/ from the reference equilibrium position of the crystal 
for which the atomic positions are Ri. The classical equation of motion, 
derived from Hamilton's equations, or from F = ma, are 

M a (cP/dt 2 )hRi = -V/0(8A's) (4) 

where Vi implies taking the gradient in respect to 8J?* and where °D is now 
approximated as a quadratic form in the $R's. The normal modes for such 
structures, taking into account the effect of the periodic structure of the 
crystal upon the form °0(8/?'s) are treated in detail in many texts, the 
general methods being analogous to those employed in other parts of the 
chapter. 3 These normal modes are required to have the property that 
all atoms undergo simple harmonic oscillations with the same frequency 
and phase [except for reversals of direction corresponding to phase differ- 
ences of 180°, as shown in Figure 5.3(j) to (n)]. We shall denote the 
amplitudes of the normal modes by the normal coordinates q. In terms 
of the normal modes the displacements of the atoms are found to be 

hRi = £ G^q^ cos ft • Ri + q kae sin ft • Ri) (5) 

k,a 

where each normal mode coordinate undergoes simple harmonic motion 
independent of the others. The wave vector ft is required to satisfy the 
periodic boundary conditions so that 

k x A x = 2vn x , kyJy = 2irn y , k z A z = 2ir» 2 , (6) 

'Sections 14.7 and 15.3 for analogous problems; for textbooks see, for example, F. Seitz, 
The Modern Theory of Solids, Section 22, McGraw-Hill Book Co., New York, 1940; L. Bril- 
louin, Wave Propagation in Periodic Structures, McGraw-Hill Book Co., New York, 1946; 
and A. H. Wilson, The Theory of Metals, Cambridge at the University Press, 1936, Chap. VI. 
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where the w's are integers. For each wave vector ft, there are three 
directions of polarization, corresponding to values 1, 2, 3 for subscript a; 
these three directions of polarization are specified by the unit polarization 
vectors G^. In an elastically isotropic crystal, one of the three G's for 
a given ft is parallel to ft and represents a longitudinal wave while the other 
two G's are perpendicular to ft and to each other. For a cubic crystal, 
as is shown in Problem 1, the three G's are perpendicular although the 
property of being purely transverse and purely longitudinal is not pre- 
served. For each ft and a, there are two patterns of vibration one vary- 
ing as cos ft ■ r with normal coordinate g^ and one as sin ft • r with normal 
coordinate qk as . 

The lattice of ft-values described by these equations is simply related 
to the basic lattice of P vectors in the Brillouin zone [see equations (9) to 
(11) section 5.5] by the equation 

hk = P. (7) 

The distinct values of ft are also limited to the Brillouin zone and we shall 
refer to the limited set of ft's as the basic set of ft's. The relationship 
between the ft and/* values plays an essential role later in connection with 
electron transitions for which changes in P excite lattice waves whose 
ft-values are given by the changes in P/h. 

So far as the standing waves described by cos ft • Ri and sin ft • i?; are 
concerned, it is evident that ft and —ft give rise to exactly the same dis- 
placements; hence, in the sum over the possible values of ft we shall re- 
quire that opposite values are excluded. (This can be done by excluding 
all negative n x ; for n x = excluding negative «„; and for n x = n v = 
excluding negative n z . See Problem 1 for questions of counting.) It is 
also evident that equivalence relationships exist for ft-values on the sur- 
face of the Brillouin zone in the same way as they do for P-values [see 
Figure 5.11(a)]. For a simple cubic crystal, the edge of the Brillouin 
zone for ft corresponds to the value ir/a where a is the lattice constant for 
\k x \, \k y \ and \k z \. For each value of ft, there are three possible directions 
for the unit vector G^ distinguished by values 1, 2, and 3 for a. The 
frequencies of vibration of the normal modes are determined by ft and by 
the direction of polarization and are denoted by co^, the same frequency 
being obtained for both the sine and cosine modes. We shall not derive 
these results nor the other consequences, which we shall quote, of the 
analysis of the normal modes, but will instead outline the method of 
derivation in a set of problems at the end of this section. 

Three normal modes are of special interest; they correspond to ft ■= 0, 
and to w , a = 0. For one of them we may take Goi as i x , the unit vector 
along the x-axis. The sine term vanishes for ft = 0, and the cosine term 
gives a displacement 8i?; = i x qo,i corresponding simply to translation of 
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the crystal. These three modes represent motion of the center of gravity 
of the crystal and may be expressed in the form 

Re = Lqoi + «!/?02 + Uqo3 ( 8 ) 

where R c = Z 8fli/W, summed over all nuclei since the displacements 
due to each mode with k j± satisfy the equation £ 8#; = °- 

The entire normal mode system corresponds to a transformation of 
coordinates of the form 

Xi = Y.*Wj 

where * stands for a nuclear coordinate, such as the j-component of 8i?, 
and qj for one of the normal modes. From the general nature of the nor- 
mal mode problem, it follows that the Hamiltonian function in terms of 
the normal modes is expressed in the form 

where M e = N,M a = the mass of the crystal; (10) 

M = Mo/2; ( n > 

p c =LP, ( 12 ) 

i 

The mass of each nucleus should have added to it the mass of its core and 
valence electrons which follow it in its motion; thus M c will really be the 
mass of the entire crystal. The vector P c is the momentum of the crystal 
as a whole and is conjugate to R c . The sum of the other terms extends 
over all q ka c and q ka s for which k ^ 0. 

For future use we shall write the Hamiltonian in the abbreviated form 

^C = ^ 7 Pc 2 + i:^i ( 13 ) 

2M C 

where *' stands for any of the ka pairs for the standing waves so that 

Xi = db ^ + pi * + M2wi2 {q " + q " )] ' (14) 

We shall henceforth refer to the q's as the normal modes for the standing 
waves, or q, td for short, and to R c as the center of gravity of the crystal. 

The transformation to the q's and R c is a contact transformation in 
both the classical and the quantum-mechanical sense (see Problems 2 and 
3) so that the eigenvalue and wave function problem is equivalent to that 
of a free particle corresponding to R c and to 3(iV 8 - 1) independent 
harmonic oscillators. 

The following three problems amplify the reasoning behind these 

results. 
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Problem 1. This problem constitutes a proof that the waves of equation 
(5) are normal modes for a simple cubic lattice in which each atom is at a 
center of symmetry for the undeformed state. This simplified case will 
serve to illustrate the general features of the normal mode problem. (For 
more general crystals see references in footnote 3 of this section.) 

Suppose that for the equilibrium reference position, one atom is at the 
origin and the other atoms are located at positions specified by lattice 
vectors /?,-. The displacement of atom / from its reference position J?,- is 

(*.-, yi, z,-) = hRi. 

The atom whose reference position is 2?, + R s has a displacement which 
we may write as 

( x i+s, yi+n Zi+s) = 8#j+s. 

For small displacements of the atoms the force F t on atom / will be a linear 
function of the displacements. Show that its x-component Fi X has the 
form: 

d 2 F d 2 F d 2 V 1 



-Fix = Z, x i+ , — — h y i+s — — h Zi 

a L dXidXi+a dXidy i+s 



dxidx i+s ' J '" r * dXidy i+s ' ~ l+ ' dx t dz :+s 
d 2 V , , d 2 V 



= L (Xi+s - X{) j^ 1- (y i+s - v,0 



dXidXj + , dXidy i+s 

Mf ^ d 2 F 1 

+ (2 !+s - Zi) -— f 

dXidz i+s J 
the second equality following from 

xiL — h yiL — r- ZiL — = 0; 

s dXidx i+s s dXidy i+a s dXidz. i+s 

an equation which states that translation of the crystal as a whole produces 
no restoring forces. 

Show that the wave motion 

Hit = Ge ihS<+iat = Ge ie < 

satisfies the equations of motion, provided that 

-MJii = u 2 M a G x e iBi = -F ix 

= e ie <G x Z(e ik - R ' - 1) ~^f- + e^G,Z(e ift * - 1) d ' F 



dXidXi +s B dXidyi+s 

+ e«'G,i;(e tt *-l)-4^- 

s oXiaZi^. s 

= e i9i [M XX G X + M xy G u + M M GJ; 

and similar equations for G y and G z are satisfied. From the periodicity of 
the crystal lattice and the assumption that each atom is at a center of 



17.3] 



PROBLEMS ON NORMAL MODES 501 



symmetry, show that 

d 2 V B 2 V d 2 V 



etc., 



dXidy i+s dXidyi- s dyidx i+3 
and hence that the quantities 

M X y = My X , M XZ = M ZX , My Z = M Z y 

are real functions of ft. Thus the equations of motion reduce to 
w 2 M a G x = M XX G X + M xy G y + M XZ G Z 

03 2 M a Gy = My X G X + MyyGy + My Z G Z 

w 2 M a G z = M ZX G X + M zy G y + M ZZ G Z . 
These equations are equivalent to the symbolic equation 

co 2 M a G = MG 

where M is the matrix having the M's as components. This matrix is 
Hermitian since M xy = M yx and M yx = M yx * because the M's are real. 
The eigenvalues u 2 M of this Hermitian form are all positive, as may be 
shown by evaluating the potential energy. Consider the displacement 

hRi = Real part qGe ik ' Ri . (A) 

This produces a force on atom ; for which the ^-component, for example, is 
F ix = - Real part of e 1 '*-^ [M XX G X + M xy G y + M XZ G Z ] 
= — q cos k ■ Ri [M XX G X + M xv G y + M XZ G Z \ 

The energy required to increase q by dq is 

-Y.Fi ■ d{iRi) = (Z cos 2 k ■ Ri)qdq X \M XX G 2 + M xy G x G v + M xl G x G z 

+ My X GyG X + MyyGy" + M yfi y G Z + M ZX G Z G X + M Z yG Z Gy + M ZZ G f\ 

so that the stored energy due to (A) is [except for h = and k = 
(ir/«, w/a, ir /a)] 

q 2 iN,(G\M\G) 

where (G|Af|G) is the quadratic form of the G's. In proving this result 
obtain also for later use the relationships 

E cos (ft • Ri) cos (ft' • Ri) = L sin (ft • Ri) sin (ft' • *,■) 

= {NJ2)h,v 

unless ft = or ft = (ir/a, ir/a, ir/a), for which cases the value is twice as 
large for the cosine sum, while the sine sum vanishes. Show also that 
terms in which mixed sines and cosines occur all vanish. 
Since the equilibrium position (all q — 0) is assumed to be minimum 
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energy, it is thus evident that (G|M|G) is a positive definite quadratic 
form so that the eigenvalues w 2 are positive. Since the coefficients in M 
are real, the eigenvectors (G x , G y , G z ) can be chosen to be real. 

Prove from the properties of the Hermitian form that the three eigen- 
vectors G, denoted by G a , or G\, G 2 , G 3 , are mutually perpendicular. 
From the same properties prove that for the displacement 

BRi = Gx (qu cos k ■ Ri + q u sin k ■ R { ) + G 2 (q 2c cos k ■ Ri + q 2s sin k • Ri) 

+ G 3 (q 3c cos k ■ Ri + q- is sin k ■ R { ) 
the stored energy is 

\N s Ma W{qU 2 + qu 2 ) + « 2 2 ( ?2c 2 + ?2 S 2 ) + «3 2 ( ? 3c 2 + ?3 S 2 )] 

and that the kinetic energy is 

\N s M a [qu + 4u + he 2 + bs 2 + q 3 c 2 + q 3s 2 ]. 

Prove that no interaction terms arise between displacements coming 
from different values of k of the basic set, equation (6). 

Show that for k — the sine terms should be omitted and that o = 
so that appropriate coordinates are q i, q 02 , qos giving 

SRi = i x q i + iyq 02 + i z q 03 

where the i's are unit vectors along the cartesian axes. 

Count the number of normal modes produced by this process and show 
that including the motion of the center of gravity of the crystal there are 
just 3N S normal modes corresponding to the three degrees of freedom 
per atom. [Consider especially the differences produced by odd and even 
numbers of atoms along the edges of the crystal and note the equivalence 
of points on the surface of the Brillouin zone; see Figures 5.11(a) and 
5.6(d).] 

This shows that the 3iV s coordinates of the nuclei can be expressed in 
terms of the 3A^ 3 normal modes with the kinetic and potential energy terms 
becoming 

K.E. = ^ ($oi 2 + g 02 2 + m 2 ) + % £ all g 2 

M 

P-E- = IT Z I>fa* 2 (?fe« 2 + qkJ) 

where M c = N s M a is the mass of the crystal and M = M c /2. 

Problem 2. Show that Schroedinger's equation in the normal mode 
coordinates is correctly obtained by replacing the q terms by p/M and 
using (h/i)d/dq for p. (An outline of the proof is as follows:) 

It is, of course, a basic assumption that for the nuclear coordinates 
themselves Pi = (h/i)V Ri . It must be proved that the transformation 
from the R/s to the #'s leads to the appropriate form for the kinetic energy 



17.3] 



PROBLEMS ON NORMAL MODES 503 



operator for the q's. This can be proved by showing that 

Prove this in more general terms by showing that if a real linear transfor- 
mation A from one set of coordinates qi to another set Q, has the form 

Qi = E*ij?j 

and preserves the diagonal form of the kinetic energy 

JLMiQi 2 = 2>tfA 
» i 

then the transformation B given by 



£,-y = VMi/mj an 
has the property 

i 

so that, since the % are real, B is unitary. (See Problem 3.) From the 
unitary property it follows that 

i 
From this prove that 

which shows that Schroedinger's equation takes its proper form in the 
normal mode coordinates. 

Prove that the classical momenta P { = M l Q i and p, = mfa transform 
according to 

Pi = "Lcijpj where ca = (Mi/m^a^ = VMi/mj b^ 
i 

and that this transformation is a contact transformation since 

EJVft = HPidqj. 

i 3 

Show that the operators (h/i)d/dQi and (h/i)d/dqj also transform the 
same way, that is, 

h d _ %d_ 

Show that, if a set of operators pi, q t which satisfy the commutation 
relations 

(ptqj - qjpi) = (,i/h)dij 
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are transformed according to 

Pi = Hcijpj and Qi = IXtfj, 

i 3 

then the operators Pi and Qi satisfy similar commutation relations. 

Problem 3. Establish the properties of a unitary transformation of n 
variables Xj to n variables jy,-. Let the transformation A be 

yi = ^LaijXj, or symbolically, y = Ax (A) 

i 

where all quantities may be complex. If the transformation satisfies the 
requirement 

"Lyi*yi = I>A;> 

i j 

it is said to be unitary. From this show that 

YL a i]* a ik — &jk 
i 

so that the reciprocal transformation A~ 1 has the matrix 
{A~ r )n = an* where A~~ x y = x. 

Show that, if A is a unitary transformation, so is A~ * and hence that the 
aij's also satisfy 

Tl a ji* a ki — bjk- 
i 

17.4 THE INTERACTION BETWEEN ATOMIC MOTION 
AND ONE EXCESS ELECTRON 

In this section we shall consider the effect of adding one extra electron 
to the crystal over and above those necessary to complete the valence 
bonds. We require that its wave function satisfy periodic boundary con- 
ditions with the same boundary-periods A x , A y , A z that apply to the nuclei. 
The electron interacts with the nuclei and with the valence electrons. 
The state of the latter, we shall suppose in keeping with the last section, is 
determined by the nuclear positions. Thus the interaction energy may be 
taken to be ^Ifo, 8i?'s) where r e is the vector position of the electron and 
the SR's are the nuclear coordinates, equation (2) of Section 17.3. Since 
our model is periodic, % must satisfy the equation 

GU(r e + A, BR's) = Gltfe SR's) (1) 

where A is any vector whose components are integral multiples of the 
boundary-periods A x , A y , A z . In terms of the normal coordinate q's and 
the position of the center of gravity of the crystal, % may be written as 

qi = %(r„ - R c , q's) = sUoto - R e ) + G U 1 (r e - R c , q's) (2) 
as may be seen as follows. If the q's have certain values, corresponding to 
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a deformation of the lattice which leaves the center of gravity of the nuclei 
fixed so that R c = 0, then the potential energy will be ^(r^ q's). If 
R c ^ 0, the effect is to shift the entire potential field by R c ; this is taken 
account of by replacing r e by r e — R c . The ^o term corresponds to the 
perfect lattice and does not depend on the q's. The ^i term vanishes 
when the ^'s are all zero and represents the effect of deformations ; it is 
the term which causes transitions. In making calculations we shall con- 
sider only the terms in ^i which are linear in the q's. (Since the electron 
at position r e is probably not at a minimum potential energy, linear terms 
will be present.) 

This form for the interaction of the electron with the lattice leads to the 
Hamiltonian 

^ = ^T7 p * + £3C< + T- Pe 2 + ^ (r. - *«, q's) (3) 

2M C i 2m e 

where the subscript e refers to the actual, not effective, mass of the electron. 
Since R c now occurs in DC, P c no longer commutes with !JC and is not a 
constant of motion. However, since r e and R c occur only in the form of 
the difference vector r = r e — R c , it is possible to replace r e and R c by two 
new coordinates and P c and p e by two new momenta according to the 
following transformations: 

PT=P e +Pe, P = Jf T P.-jt T *. W 

rT = w; re + it T Rc > r - r --*- - (5) 

M T = M c + m e . (6) 

In these equations pr and ry are evidently the momentum and the position 
of the center of gravity of the (T for) total system of electron plus crystal. 
As the reader may verify, this reduced mass transformation brings DC to 
the form 

DC = (1/2M t )Pt 2 + LDCi + (l/2m)p 2 + ^(r, q's) (7) 

i 

where m is the reduced mass: 

m = m e M c /(m e + M c ) = m e M c /M T . (8) 

The inverse transformation is 

p. = (m e /M T )p T +P Pc= {M c /M t )Pt ~ P (9) 

r e = r T + (M c /M T )r R c = r T - (m e /M T )r. (10) 

The reader should verify for the ^-components, for example, of this trans- 
formation that the conditions of Problem 2 of Section 17.3 are satisfied so 
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that the operators (h/i)V T and (V')V rr ma Y be substituted for p and p T - 
The expression for "X may advantageously be written as the sum of 
four terms 

3C = DC rr + 3C r + E30 + silifoYs) (11) 

i 

where the DC,- terms correspond to the normal modes, equation (14) of 
Section 17.3, and the new Hamiltonian expressions are 

DC rr = (\/2M t )Pt 2 (12) 

DC, = (l/2m)p 2 + %(r). (13) 

Except for the term ^i, each term involves only one set of conjugate 
coordinates and momenta. For such cases the coordinates are said to be 
separated and the solutions for the equation (DC - %)§ = S# can be 
obtained in terms of products of eigenfunctions for 'DC rT , < X r and the various 
"DCi, a procedure which forms the basis of the calculation of the matrix 
elements for transitions given in Section 17.6. We shall deal in this section 
with the wave functions for r T and r, leaving the q's for Section 17.5 and 
the entire wave function § for Section 17.6. 

We shall thus consider a wave function of the form \(r T )^(r) which 
satisfies 

(DC rr + DC r ) X (T)v|/(r) = S X (r r )v|r(r) 

and the proper boundary conditions. 

In accordance with the customary method of separating variables, we 
divide this equation by x*|> 5 noting that °X rT does not operate on r in 
\}f nor does DC r -operate on r T in x. we obtain 

(X rX /x) + CWIO = & 
From which we conclude that both terms on the left are constants (inde- 
pendent of r and r T ) so that we have 

and 

S' + §" = g. 

It is evident that^r commutes withDC rr so that p T is a constant of motion. 
This leads to 

x = V-* exp (iP T ■ r T /h), &' = P T 2 /2M T (14) 

where V = A x A y A z is a normalization factor so that the integral of 
X*XdF rT over the volume is unity. 

The equation for \|/ is identical with that of an electron moving in a 
fixed potential field 6 U ('') except that m occurs in place of m e . Since 
m = m e M c /(M c + m e ) = m e /[\ + (mJM e )], 
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the difference between m and m e is negligible. The solutions of this equa- 
tion may then be taken to be the customary Bloch functions v|/p(r) where 

X+P(r) = HP)W) ( 15 ) 

^(r) = r-x exp (iP ■ r/ft)u,(r) (16) 

where u P {r) is normalized to unit volume and has the periodicity of the 
undistorted lattice. 

The boundary conditions for Tt and r are deduced from those of the 
electron and the nuclei. If any of these particles is advanced by one of 
the periods A x , A y , A z , then the wave function must be unaltered since 
the new configuration is equivalent to the old. 

The boundary condition for x(r T ) is found as follows: If all of the parti- 
cles are simultaneously advanced by one of the periods A x , A y , A„ the 
same change takes place in Tt, but no changes occur in r or in the ^'s. 
It is evident, therefore, that x( r r) must satisfy the periodic boundary 
conditions and consequently Pt must be a vector of the form 

P T = h[i x (2wn x /A x ) + i y (2irn y /A y ) + i,(2im,/A t )] (17) 

where the n's are integers. These vectors lead to periodic behavior for x 
and include the basic lattice of the Brillouin zone. However, their values 
are not restricted to the interior of the zone, and the energy P T 2 /2Mr 
increases indefinitely with increasing \Pt\- 

We shall have other occasions to deal with vectors of the form (17) 
which are not restricted by the Brillouin zone boundary. We shall refer 
to them as vectors of the extended P-lattice. We shall similarly refer to 
vectors 1/ft times as large as vectors of the extended k-lattice. 

To find the factor associated with increasing r by one of the periods 
A x , A y , A„ of the boundary conditions, we consider the effect of advancing 
the electron alone by i x A x . This must have no effect on the wave function. 
However, the change of i x A x in r e and zero in R c is equivalent [equation 
(5)] to changing 

r T by i x A x (m e /M T ) and r by i x A x . (17a) 

These change x^ by the factor 

exp [,■ (P T ■ i x A x ^ t +P. i x A x )/h] = exp \i (^P T + p) ■ i x A x /h_ ■ 

From this and the similar equations for A v and A z , we conclude that the 
vector in parenthesis must be a vector of the extended P-lattice, that is, 
of form (17). Thus we may write 

P =P e - {m e IM T )P T (18) 

where P e is a vector of the extended lattice. It will be important in later 
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calculations that the difference in P for constant values of Pt are vectors 
of the extended lattice, even though the P's themselves are not. (The Pt 
term corresponds to correcting P e for the motion of the center of gravity to 
obtain P for the relative motion; see Problem 1.) 

In Section 8.6 we shall prove that a selection rule holds for transitions 
due to ^i. This rule resembles closely the requirement of conservation of 
momentum, the change in P for the electron being compensated by a change 
in a similar quantity for the lattice vibrations. Actually conservation of 
momentum is not concerned at all since the transitions do not involve Tt 
or Pt, which represent the total momentum of the system. The conserva- 
tion law is fundamentally associated with the existence of a set of operators 
£ (Rj), which we shall treat in the following paragraphs. £ (Rj) commutes 
with 3C and, consequently, its eigenvalue is a constant of motion. It is 
really to the eigenvalue scheme of the £(Rj) operators that the conserva- 
tion law applies. 

The Lattice Displacement Operator} We shall, therefore, consider a 
group of operators which produce displacements not by the periods A x , A y , 
A z of the boundary conditions but instead by the vectors Rj of the crystal 
lattice. The operator £(Rj) has the effect of advancing the entire wave 
function by Rj in the following sense: the wave function £(Rj)$ has the 
same value as $(R C , r e , q's) when the electron coordinate in £(Rj)$ is 
r e + Rj and the nuclear coordinates in £(Rj)§ have values which produce 
the same deformation as those for § except that the deformation pattern 
(but not the nuclei) is advanced by Rj. It is evident from the periodic 
properties of "D and % that if * is an eigenfunction of 3C, so is £(Rj)i. 

It may be proven generally that £(Rj) commutes with 3C and the eigen- 
values of £ may also be determined. For this purpose we express § in 
terms of a set of "deformation coordinates" S» 

Si = 8i?j — R c = R{ — Ri — R c - 

Thus the S's represent the deformation which takes place in addition to 
simple translation of the crystal. The wave function may then be written 
as $(R C , Si, r e ) where the N, + 1 variables R c and S< are linear functions 
of the iV a nuclear vectors R/. The effect of £(Rj) upon § is evidently to 
replace the variable Si in § by the variable Si where 

Ri = Ri + Rj 

and to replace r e by r e — Rj. If the deformation pattern and the electron 
are advanced by Rj, then Si will take on the values formerly associated 
with St; and r e — Rj will take on its former value r e so that 

[£(i?y)#] (shifted pattern) = * (unshifted pattern). 

1 The treatment of this operator, which was suggested to the author by C. Herring, puts the 
selection rules dealt with in Section 17.6 on a more fundamental basis. It is not, however, 
an essential step in obtaining the formulae for mean free path. 
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This equation means that the wave function denoted by £(Rj)$ takes on 
the same values as $ when the pattern of nuclei and the electron are ad- 
vanced by Rj. 

The kinetic energy operator commutes with£(/?y) since it has the effect 
of taking V 2 in respect to all the coordinate positions in $ and it makes no 
difference whether this is done before or after the coordinates are changed 
by £(Rj). Since the values of the potentials °D and % are unchanged by 
changing the coordinates by £(Rj),£(Rj) commutes with them and hence 
with DC for the entire system. 

The operators £ (I?,-) obviously satisfy the equations 

£(*,-)£(*,■) = £(R,)£(Ri) = £(Rt + Rj) 
and 

£(i x J x ) =£(iyA y ) = £{i z A z ) = 1 

since the last operations merely move the electron by one of the boundary 
periods. Consequently, if the eigenvalues of £{i x a), £(i y a) and £(Ua), 
where a is the lattice constant of a cubic structure, are written in the form 2 

exp (-iP £x a/h), exp (-iP £v a/h), exp (-iP £y a/h), 

then the eigenvalue of ■£(/?»•) must be 

exp ( — iPs • Rifty 

and, consequently, P £ must be a vector of the extended lattice, equation 
(17). In this case, however, since only translations by lattice vectors are 
involved, the distinct values of P s are contained in the Brillouin zone. 

In the next section we shall see how the running waves for the nuclear 
vibrations fit into theP £ scheme and in Section 17.6 how the conservation 
of P s leads to the selection rules for transitions. 

Problems 

1. Show that when the ^i term is neglected, the wave function may be 
written in the form 

exp [i(P c0 Rc+P e - r.)/ft]*(r, q's) 

where P c0 andP e are vectors of the set hk and *(r, q's) has the periodicity 
of the lattice in r. What is the relationship of P c0 and P e to the vectors 
P T and P discussed in the text ? 

2. Show that if one of the nuclei is a heavy isotope, the operator £ (R t ) 
does not commute with the nuclear kinetic energy. (The fact that £ 
does not commute with DC does not indicate a serious failure of the selection 
rule of Section 17.6 since the latter depends on a linear approximation for 

2 The minus signs are introduced as a convention to simplify an interpretation made at the 
end of Section 17.S. 
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''111, which should be valid at low temperatures independent of the isotopic 
content of the crystal.) 

3. Show that an operator £ (/?,•) exists for a crystal like NaCl which has 
two different atoms per unit cell. 

4. Consider the operators £-(R{) from the group viewpoint. Why are 
their irreducible representations one-dimensional? 

17.S TRANSFORMATION TO RUNNING WAVES 

The coordinates q introduced in Section 17.3 correspond to normal modes 
or standing waves. By a contact transformation a new set of coordinates 
can be obtained which correspond to running waves. The required trans- 
formation has the property of mixing the p's and q's belonging to one of the 
sine and cosine pairs for particular values of the wave vector k and direction 
of polarization a. The Hamiltonian function 3Cj for these two degrees of 
freedom, which we denote simply by q c and q a , omitting k and a, is 

3Q = \p 2 + p 2 + M 2 » 2 {q 2 + q 2 )]/2M. (1) 

The transformation from the old ^'s and p's, referred to as the standing 
wave set or q st d, to the running set, referred to as the ? run , is given by 

q c = (2Af«)-*(gL. + p+) q t = (2M,)~*(?+ + pJ) 
p c = (Af«/2)«(p_ - q+) p s = (Mo>/2)*(p+ - ? _); (2) 

which transforms the Hamiltonian into the form 

DC; = §co ( P+ 2 + q + 2 + pj + qj). (3) 

From the form of the transformation, or its inverse, 
q+ = (Mu>/2) H q s - (2Mo>r*Pc q- = (Mo>/2)^q c - (2MoY*p. 

p+ = (Mco/2)^ ?c + (2Mw)-*p s p_ = (Mu/2)*q 3 + (2Mo>r H p c , (4) 

it can be shown that, regarded as operators, the new p, q Iun set must 
satisfy the quantum-mechanical commutation rules such as 

p+q+ - q+p+ = h/i, p + q- - q-p + = 0, (5) 

provided that the old p, q s td set does. (The reader should verify this as 
an exercise.) Furthermore, the p, q st d set does satisfy the commutation 
rules in accordance with the results of Problem 2 of Section 17.3. The 
transformation is also a contact transformation in the classical sense since 

p+dq+ + p—dq— — p c dq c + p s dq s + dS (6) 

where dS is an exact differential. (As an exercise, evaluate the function S.) 

The classical motion produced by p+q+ will now be shown to be a running 

wave moving in the +ft-direction while p—q— is a wave running in the 

— ^-direction. On the basis of this result it is natural to describe the new 
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variables, obtained by transforming any pair q^ac and qkaa, not as qka+ 
and qi, a — but instead as qua and q-ka simply by including the negative 
values of the k contained in the sums over the q Bt d set. (It will be recalled 
that of the possible fe-values in Brillouin zone which satisfy the periodic 
boundary conditions, only half were used with the q s a set since —k gave 
the same standing waves as +ft.) This will extend the sums in the q mn 
system over all fc-values in the Brillouin zone and to each ka pair there 
will correspond only one coordinate. 

The classical motion corresponding to p+ and q+ is obtained by solving 
the Hamiltonian equations 

q + = d"X/dp + = ap + and p+ = — d'3C/dq + — — aq + (7) 

which lead to 

q+ = A + cos (at + 0+) (8a) 

p + = -A+ sin (at + 6 + ). (8b) 

When this is introduced into the expression for 8i?„ omitting k and a sub- 
scripts, we obtain 

hRi = G[cos (k ■ Ri)q + sin (k ■ Rt)q a ] (9a) 

= A + (2Ma)^G sin [k ■ R { -at- 0+], (9b) 

which is a wave moving in the +&-direction with phase velocity <o/|ft|. 

It may similarly be shown that the p—q~ solution represents a wave 
running in the — ft-direction. The proof is left as an exercise for the reader. 

We shall next show how q + and y_ partake of the aspects of running 
waves in the quantum-mechanical sense. For that purpose we shall 
consider in a preliminary way how the interaction of the electron with the 
nuclear vibrations is calculated. As discussed in Section 17.4, the energy 
of the electron with the deformed lattice depends on r, the position of the 
electron in respect to the center of gravity of the nuclei, and on the de- 
formation expressed by the variables y at d. We expand this interaction in 
a power series in the q's and assume that the deformation is small enough 
so that we may retain only the linear terms. This leads to the equations 

^(r, ?.td) = %>(r) + G U 1 (r, ?std ) (10) 

where the perturbation term ''Ui is 

%(r, ft M ) = Efataetf'M/dftec) + flte.^/dflta.)], (11) 

ka 

the partial derivatives being evaluated at zero deformation. For a given 
ka pair, the values of q c and q„ specify a sinusoidal distortion with 8i? 2 - 
given by (9a). In the next section we shall consider how the interaction 
of the electron with such a sinusoidal disturbance may be evaluated. In 
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this section we shall consider certain general aspects of the matrix elements 
for the p's and q's and for 8/?,-. 

Although we wish to think of the deformation in terms of the q at d, it is 
advantageous to use the quantum numbers for the running waves to de- 
scribe the nuclear vibrations. Specifically we shall write the part of the 
wave function involving the q B and q c (corresponding to a particular ka) 
in the form <)>(«+,«_; q c , q s ) and require it to be an eigenfunction in- 
dividually for DC + and X_ [with eigenvalues fua(n + + i) and ftw(«_ + i) 
respectively] as well as for the total energy 3C; = 3C+ + 3C _ of two modes. 
By expressing q + and p + in terms of q Bt d> we obtain 

3C+ = &{p+ 2 + q + 2 ) = $Xi + h>(Ps<lc - Pcq s ) (12) 

and 

3C_ = iu(pj + qJ) = $Ki - &(p s q c - p c q s ). (13) 

Since 3C + and 3C_ are the Hamiltonian operators for harmonic oscillators, 
we must have 

X + <j>(«+, »_; q c , q s ) = foo(»+ + i)<K"+> "-'■> 1o, q,) 

X_<J>(«+, »_; q c , q s ) = M«- + i)<K«+, w -5 ?« ?«)> (I 4 ) 

so that the eigenvalue of 3Cj is 

Si = M«+ + w_+l) = M»i+ !)• (I 5 ) 

For a given value of §», there are «»• + 1 ways of dividing the energy 
between n+ and «_. 

The matrix element of iRi for transitions in n + and «_ is linear in q c 
and q s and hence in p+, q + , p-, and ?_. The matrix element for p + and q + 
vanish if k_ changes, since p + and ?+ commute with 3C_, and if «_ does not 
change, they vanish unless n + changes by ±1. For such changes they 
have values given by the customary formulae: 1 

(»+, n_\p + or q+\n+, «_) 

= J J <j>*(«+, »_; q c , q s )[p+ or ?+ ]<j>(» + ', »_; q c , q s )dq c dq s (16) 

1 See P. A. M. Dirac, Quantum Mechanics, Oxford at the Clarendon Press, 1930 or 1935, for 
a treatment of the harmonic oscillator. The phase factors exp (iy) in equation (34), 88, of 
the first edition enter the same way in both the p + and q + matrix elements and do not affect 
the results derived later; they may also be eliminated by proper choices of the phases of the 
representation. 
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The formulae for p_ and ?_ are the same and can be obtained by inter- 
changing the subscripts (leaving the differences due to ±1 as they are). 
The matrix element for hRi in terms of these matrix elements is found by 
expressing the matrix elements for q c and q s in terms of (17) and (18) and 
inserting the results in (9a) ; the value is 

{n + , n.\m\n + ± 1, «_) = G JH \ V ^± + ~ l ex P <'* ' *> (19) 

The quantity M, defined in Section 17.3, equation (11), and at the end of 
Problem 1, is one half the mass of the crystal. The similar expression of a 
transition in »_ is found to be 



(*^|8ftk,nu ± 1) = G J^\ iV ^^ { -l k ; Ri) (20) 

\4Mo [-iVK_ exp (tk ■ Ri). 

In both of these expressions the real and imaginary parts correspond to the 
q c and q s terms individually. 

Since these matrix elements depend only on the quantum number which 
changes, we may express them as functions of this number and describe it 
simply as n ka corresponding to q ka . Both elements then take the form 

(n ka \BRi\n ka + bn ha ) 

= G ka [h(2n ka + 1 + tataVSMwfcJ* exp (iSn ka k ■ R { ) (21) 

except for a factor ±» which may be disregarded since only the absolute 
value of (21) is needed for calculating the transition probabilities. Unless 
\5n ka \ = 1, the matrix element vanishes. 

The general wave function which describes the nuclear vibrations and is 
an eigenfunction for all the 3C run with quantum numbers n ka can then be 
written as a product 

<K«*a's) = n <!>("**<*,>«-*,•«,•; qkictjo qkiaja) = n <}>,, (22) 

there being as many terms in the product as there are distinct ki<xj pairs 
of values in the set of q s n. The last expression is an abbreviated form used 
in the next section. There will be matrix elements of the q 3i d, and con- 
sequently of hR{, only for transitions in which one and only one n k changes 
by ±1. 

The Effect 2 of the Operators £(/?«) on $. The fundamental reason why 
the eigenvalues n ka for the running waves are suitable is that in terms of 
them and the functions typ and \p T the entire wave function § becomes an 
eigenfunction of £(R a ). The meaning of this statement is that the wave 
function § , obtained by the operation £(R f )i, is simply L s § where L s is 

2 As for the section on the operators £{R S ) in Section 17.4, it is not necessary to use this 
method to calculate the mean free path. 



514 THE THEORY OF TRANSITION PROBABILITIES [17.5 

the eigenvalue (shown below to be a number with absolute value unity). 
The wave function £(R e )§ = $' was defined in Section 17.4 as follows: 
Consider the value of $ for some set of values, denoted symbolically by 
Q, of the coordinates of the entire system; then shift the deformation 
pattern of the nuclei and the position of the electron by R a and call the 
new set of values of the coordinates Q' . Evidently Q' is a function of Q 
and R s and Q has the same functional dependence on Q' and — R s ; these 
relationships may be written as 

Q' = 2.(Q,R a ) and Q = £{Q',-R a ). (23) 

(We shall later add subscripts to a when dealing with particular coordi- 
nates such as r and r T .) The definition of £(R a )§ is then 

[£(*.)*]G2') - *'(0') = *(0) (24a) 

so that 

*'((?') = *[a(0', -*.)]. (24b) 

Showing that § is an eigenfunction of «£(i? s ) reduces to showing that 

*'(<?') = *[2(0', -*.)] = £.*(0') (24c) 

i.e. that §' of any set of values of the coordinates, Q' being such an arbitrary 
set, is simply L a times § of the same set of values. 

In order to establish the above result, we must determine the functional 
relationship 2. for the coordinates Tt, r and the q's. The relationships for 
r and It are simply 

r' = a, (r, R.) = r + R s ; 

%(/, -R s ) = r' - R a (25a) 

r T ' = % T {r T , Rs) = r T + (m e /M T )R s ; 

% T (r T ', -R s ) = r T ' - (m e /M T )R e (25b) 

as may be seen from equation (5) of Section 17.4 and the fact that the shift 
of the pattern of nuclear distortion does not alter £ BR so that R c is 
unchanged. 

In order to deal with the effect of the shift of R s upon the <?'s we consider 
one ka pair, denoted by q e and q s the subscripts ka being omitted for 
brevity. We introduce a new pair of coordinates: 

P = (qc 2 + q?) V \ 6 = tan" 1 (q./q.) (26a) 

q c = p cos 8 q„ = p sin 6. (26b) 

In terms of these coordinates, the shift R s of the deformation pattern 
has a very simple effect which we shall now derive, the result being given 
in (32a). The coordinates themselves amount to expressing the two har- 
monic oscillators q c and q, as a two-dimensional isotropic oscillator. For 
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such an oscillator, the angular momentum operator (h/i)d/dd commutes 
with DC and has eigenvalues varying from hn to — nh in steps of 2h where 
n prescribes the total energy in the form hw(n + 1). Without referring 
to this interpretation of 0, however, we see that purely formally we have 

i'd0~ i Idd dq c + 38 3g~J ~ 7 L q ° dq~ c + q " ~3q~J 

= q c ps — q s pc- (27) 

From this we see that equation (12) becomes 

3C+ = pd + (ho>/2i)(d/dd) (28) 

and consequently that the wave function <)>(«+, n~; q c q s ) is an eigenfunc- 
tion of d/38 with eigenvalue /(» + — «_). We may, therefore, write 

- <)>(»+, »_; q c , q„) = /(»+ - «_)<)>(»+, »_; y c , ?,) (29) 

so that 

<j>(«+, »-; ? c > ?«) = exp /(» + — w_)0 X function of p. (30) 

Expressed in terms of p and 8, the contribution of ka to 8/?; may be 
rewritten as follows: 

bRi = G ka [cos k ■ R{ p cos + sin ft • Ri p sin 0] 

= C^pcoslk-Ri -0]. (31) 

Thus the effect of the "shift" R s upon the pattern is to change to d' = 
+ ft • /?„ so that the value of 8/? formerly obtained at Ri is now obtained 
at Rj + R s . Accordingly we may write that 

6' = 2,(0, «.) = 8 + ft • R s ; %{&', -R.) =8' - k-R 8 (32a) 

p' = 2 p (p, R.) = p; 2 p (p', -*.) = p\ (32b) 

Similar relationships will hold for all values of ft and a. 

In terms of the properties of Xp T (r T ), typ(r) defined by equations (14) 
and (16) of Section 17.4 and those of the <]>'s established in (30), the value 
of 

§ = Xpi4H>0 v/ *«' s ) (33) 

is readily evaluated for the set of coordinate values 2(<2', — R s ) as re- 
quired in equation (24c). In the evaluation each factor of § gives a term 
of the form exp ( — iP- R,/h) or exp [ — /(»+ — «_)ft • R s ]. The terms 
(n + — «_)ft may be rewritten as n k k + »_*( — ft), so that the eigenvalue 
L„ defined in Section 17.3 as exp (—iP £ ■ R a /h), becomes 

L s = exp { — iPs. ■ R»/h) 

= exp -i[P + (mJM T )P T + KLn^k] ■ R./h (34) 
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where the sum extends over all ka pairs, including ft and -ft as discussed 
early in this section. 

The vector P + (m e /M T )PT is a vector of the extended P-lattice, as 
discussed in connection with equation (18) of Section 17.4, and so is the 
sum of the hk vectors according to equation (7) of Section 17.3. Con- 
sequently, i> £ is a vector of the extended P-lattice, in agreement with the 
conclusion reached at the end of Section 17.4. 

The minus sign in L s arises from the fact that we have dealt with this 
problem from the viewpoint of advancing the wave function by R s rather 

than from the viewpoint of changing the set of values of the coordinates 

from Q to SL(Q, R s ) in one and the same wave function. The effect of 

the latter change is readily shown to be 

*[2L(Q, R,)] = exp [+iP £ ■ *./»]*(£?)• ( 35 ) 

Thus P s as defined in this section and the last plays a role similar to 

P, P T and hn ha k so far as its sign is concerned. 

Since both P £ and Pt are constants of motion for the complete Hamil- 

tonian DC, the vector P + ftl> te fc must be conserved during transitions 

produced by %, even if the effect of nonlinear terms in the q's are included. 

// is this conservation law for P £ that underlies the selection rules derived in 

the next section. 

17.6 CALCULATION OF THE MATRIX ELEMENT 

As discussed in Section 17.2, the transition probabilities arise from 
matrix elements of DC between different states of the system. The states 
are approximately eigenfunctions of VC and are exact eigenfunctions for 
an DC which differs only slightly from DC. In the case which we shall 
treat, the wave functions considered are eigenfunctions of 

DC = DC - sUx (1) 

where ^i is the term discussed in Section 17.4, equation (2), which gives 
the added interaction of the excess electron due to the deformation of the 
lattice described by the ^'s. 

The eigenfunctions * for DC may be formed from products of wave 
functions, already described, for r T , r and the q c , q„ pairs for various values 
of ka. These individual wave functions for r T and r, derived in (14) and 
(16) of Section 17.4, are 

XPr(T) = ^" K exp ( ip T ■ rr/K) ( 2 ) 

,j, p ( r ) = v~ Yi exp (iP • r/h)\i P (r) (3) 

where the factor involving V = A X A V A Z normalizes the wave function over 
the region considered, it being supposed that u P (r) is normalized over unit 
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volume. The normalized wave functions for the q's are written as 

<K»fa» n-kallkac, Ikaa) = $} (4) 

where <j>y is an abbreviation for use later, the index j running over all ka 
pairs of the standing wave scheme. The wave function for the system may 
then be written 

i(P T ,P,n's) =XP T ^P n ^ (5) 

where the variables Tt, r and the q's have been omitted for brevity. The 
Hamiltonian jfCo is 

X = (l/2M T )p T 2 + (l/2m)p 2 + %(r) + Y&i 

i 

= X rr + 3C r + E3C< (6) 

i 

in the notation of Section 17.4, equations (11) to (13). Each term in 3C 
operates on only one of the factors of $ so that the value of the energy S is 

5 = (l/2M T )P T 2 + &(P) + Z>,-[»+* + »-»• + 1], (7) 

i 

the individual terms being those discussed in 17.4 (14) and (IS) for °X TT 
and 3C r and 17.5 (15) for tK - ,-. 

The i's for the entire system form an orthonormal set, the only term 
requiring consideration being \|/j>. The reader may verify that the orthog- 
onality relationship between the vjf^'s is not disturbed by the displacement 
due toPr [see Section 17.4, equation (18)] of the allowed P-values from the 
basic P-lattice of the Brillouin zone. 

3C has, of course, no matrix elements between its orthogonal eigenfunc- 
tions § so that all transitions are a result of G Ui(r, q's). We shall suppose 
that the lattice vibrations are so small in amplitude that ^i can be ex- 
panded in a series in the first powers of the q's as follows: 

%(r,<?'s) = ZqiAd^/dqic) + q is (dm/dq ia ) 

i 

= E?i C %c(r) + qu^uir) (8) 

i 

where the last form expresses the fact that the partial derivatives are 
evaluated for all ^'s = so that they are functions of r only. 

The two terms in this series corresponding to a particular ka pair, 
denoted by i, will have a matrix element between two i's only if: 

(a) all n + jti_j for j t± i do not change ; 

(b) only one w + ,- or w_; changes by ±1. 

If (a) is violated, the term vanishes because of the orthogonality of the two 
<|)/s in the integral. For the <|>» terms themselves, the matrix element of y !c 
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and q it are nonvanishing only for condition (b), the values then being 
those given in equations (17) and (18) of Section 17.5. 

Since r T does not occur in %, P T is constant of motion, as discussed in 

Section 1 7.4, and there are flu matrix elements for which Ft changes. 

We thus need to consider only cases in which P and only one of the »'s 
change. We shall denote the change in n symbolically by «,- and »/ and 
will later consider specific ka pairs in place of /'. For this case the matrix 
element is 

(Pn i \ s U. 1 \P'n/) = f #*%$'</ (all coord.) 

= — / i|/p*(r)<t>*(«; + , w,_; q ic , qi^lqtc^ic + #.%■«] 

X typ>(r)$([n i+ , »,-_]'; qu, q is )dV r dq^dqu (9) 

where the symbol [»; + , »;_]' means that only one of the two n's has changed 
by ± 1 and dV r is the element of volume for r-space. 

Later in this section we shall deal directly with the integral of 6 U,- C and 
^is. The techniques used are somewhat specialized, however, and there 
are certain advantages in treating (9) initially by a more direct method in 
which we consider the effect of one of the q waves as the sum of the indi- 
vidual effects of the displacements 8i?, of single nuclei. We define a 
"single nucleus deformation" as follows: For a given value of r, for the 
deformed crystal, we imagine the electron and all the nuclei save i to be 
held fixed while nucleus i undergoes a displacement 8i?». The change 6 G LL, 
in the potential energy ^i for this "single nucleus deformation" will be a 
function of r, the position of the electron in respect to the center of gravity 
of the deformed crystal, and of hRi, being for small deformations a linear 
function of the three components of the latter. Accordingly we may write 

«% = iRi • W Ri (r) 

to express the change in energy due, we repeat, to displacing nucleus ;' by 
6Ri, holding the position of the electron and all the other nuclei fixed. The 
net effect of all the "single nucleus deformations" is identical with the 
deformation of the q c wave. 1 Thus the value of ^i for the deformation 
of the q c wave can be expressed in two equivalent ways: 

£5% = £8*,- • W Ri {r) = ?e GU„(r). (10) 

% i 

1 It may be noted that the sum of all the 8/?j due to the q c wave does not move the center of 
gravity of the crystal [see 17.3 equation (8) and text], so that although each "single nucleus 
deformation" does produce a small change in R c , and hence in r for a fixed position of the 
electron, the cumulative effect of all the "single nucleus deformations" does not change r. 



17.6] GENERAL EXPRESSION FOR THE MATRIX ELEMENT 519 

From the periodicity of the undeformed lattice it is evident that 

W Ri (r) = W (r- R { ), (11) 

which states that the effect on the potential of moving any nucleus is the 
same as that of moving nucleus zero except for a translation by the lattice 
vector from to R{. Integrals of Wq(t — Rj) occur in the matrix element 
of ^i and, by making use the formula for the \|/p functions and a change in 
variable of integration, we reduce these to simpler forms as follows: 

= /+p*(r' + Rj)W (r')^(r f + R-)dV, 

= e iiP '- p> - R - /n J<\> P *{r')W Q {r')^{r')dV Tl 

= e^-'W/o (12) 

since the integral over r' is equivalent to that over r. 2 

In terms of 7 and the matrix element for 8i?; as given in (21) of Section 
17.5, for a transition in which n ka — > n ka + Sn ka we obtain 

(P,« J %!?',«*„ + «»»„) = zjVs/?, ■ W Rj $'d (all coord.) 

i 

= G ka ■ 7 [A(2« Jta + 1 +8» ta )/8M« J^L exp [i(hdn ka k+P' -P) ■ Rj/h]. (13) 

j 

The sum of the exponential terms vanishes unless 3 

P' + Unjk = P (14) 

so that there are matrix elements only between states having the same 
value for 

P + ftZ»ta* (15) 

just as if a phonon momentum h^,n ka k were combined with an electron 
momentum P in a conservation law. We have seen, however, that Pt 
represents the momentum of the entire system so that this conservation 

" This conclusion depends on the fact that, although in general neither typ nor i|»p< has 
the periodicity of the boundary conditions, iI»p*i|>/>' does. The reader should verify this as 
an exercise. 

3 For the theory of conduction in metals cases occur for which P' ± hk — P is a vector 
whose components are multiples of the edges of the Brillouin zone so that this vector gives 
periodic behavior, that is, it corresponds to 2w times a vector of the reciprocal lattice. Such 
transitions have been called "Umklappprozesse" by R. Peierls, Ann. d. PJiysik 12, 154-168 
(1932). 
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law has another meaning, the one which is given at the ends of Sections 17.4 
and 17.5. 

To summarize: There are matrix elements only for changes in $ which 

(a) do not change Pt ; 

(b) change only one of the n^ and that to n^ a ± 1 = n^ + 5»ft a ; 

(c) change P to P' so that P = P' + Mn ka k. 

For any values of P and k, there will always be an allowed value of P' 
which will satisfy (c) because of the fact that, like hk, differences between 
P and P are vectors of the extended lattice of the Brillouin zone. 

If these conditions are satisfied, all the terms in the sum over Rj are 
equal and add to N s so that, except for a phase factor of absolute value 
unity, we obtain 

(P» fc ,| G Ui|/ ,/ »te + Sn ka ) 

= G ka ■ / (P, *»')#, [ft(2» te + 1 + 5K fta )/8Mo fta ] M (16) 

where it will be recalled iV s is the number of cells in the crystal and 
M = N s M a /2 = M c /2 is where M a is the mass of the atom in each cell. 
The dependence of 7 upon P and P' is indicated. 

Several methods have been employed to evaluate the matrix element 
Gka ■ h based on various approximate treatments of the way the potential 
energy % varies with the displacement of the nuclei. We shall discuss only 
the most recent of these proposed by Bardeen and Shockley, 4 which 
evaluates the matrix element in terms of certain general properties of the 
energy band picture in a way which permits correlation with other experi- 
mental data. 

17.6a. The Method of Deformation Potentials. The essence of the 
method described here is to think of the deformation of the phonon wave 
as producing local variations in the size and shape of the unit cell. Figure 
17.5 shows qualitatively how the band structure may be expected to vary 
with lattice constant for a crystal like silicon or germanium. If the 
deformation waves simply changed the lattice constant, without altering 
the crystal symmetry, then the energy for an electron at the bottom of the 
conduction band would vary as curve & c . If the electron were in a state 
near the bottom of the band its energy, shown by curve 6> C (P), would also 
rise by almost the same amount. We shall shortly consider the effects of 
more general deformations than the pure compression or dilation of 
Figure 17.5. 

There is, unfortunately, a difficulty in applying the slopes of the curves of 

4 W. Shockley and J. Bardeen, Phys. Rev., 77, 407-408 (1950) and a subsequent article 
planned for 1950. In the latter article it is shown that the method of the "effective mass" may- 
be applied to the deformation potentials. Also Sections 11.3b and 12.8. 
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Figure 17.5 to the changes in energy of an electron in different parts of a 
deformed crystal. These have to do with the lack of a precise meaning for 
the zero reference value of the potential G Uo('') in a homogeneously de- 
formed crystal. When curves like those of Figure 17.5 are computed 
numerically, some arbitrary procedure must be adopted. For example, in 
calculating the diamond curves of Figure 5.5, Kimball used a potential 
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Fig. 17-5 — Variation of Energy with Lattice Constant. 



which took on the same numerical value near the interior of the atom for all 
values of the lattice constant and, as he points out, this choice is purely 
arbitrary. 

An absolute meaning can, however, be attached to the curves of Figure 
17.5 in the following way. Start with an undeformed crystal of lattice 
constant a . Imagine a deformation which leaves one portion of the 
crystal unaltered but, as a result of a continuously varying deformation, 
produces another large region in which the crystal is uniformly dilated to a 
value a\. Under these conditions, one might think that the values of the 
energy at the bottom of the conduction band, namely, S c («o) an d S c («i) 
would have definite and unique values. This will not be the case in 
general, however, because accompanying the deformation there will 
usually be electric fields and, although for any particular deformation the 
value of & c (ai) may be uniquely related to S c («q)> another distortion also 
leading to the same value for m, may produce different electric fields, dif- 
ferent average potentials in the dilated region and, consequently, different 
values for & c (ai). These electric fields are hard to estimate but are not 
excluded even for non-polar crystals like diamond. However, the experi- 
mental data on mobility versus temperature in silicon and germanium 
suggests that they are not important in the ordinary range of observation. 
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In a problem at the end of this section we show how the deformations of the 
longitudinal phonon waves produce potentials which vary as A/k 2 where A 
is the dilatation of the ?-wave, but that for covalent crystals this effect is 
small compared to the effect of shifting the bands. (See problem 1 of 
this section.) 

Unlike the individual energies & c and 6> v , the energy gap & g is an absolute 
function of the deformation of the lattice since any variation in the zero of 
potential has simply the effect of shifting all the energies equally. We have 
discussed in Chapter 12 the ways in which the variation in the energy gap 
with lattice constant may be determined experimentally. 

According to the assumption that the long-range electric field effects, 
which make the potential depend on the strain pattern, are negligible, we 
shall consider tb c and 6> v each to be a unique function of the local strain. 
For a cubic crystal up to linear terms in the strain, this leads to the con- 
clusion that 

S c = ScO + £>i„A (17a) 

[abbreviated as & = S + &iA for equations up to (35)] (17a') 

S„ = &vo + £>u>A (17b) 

&q> = &go + SjgA (17c) 

Sig = Si„ — &i P (17d) 

where A is the dilatation 

A = Sxx + s uy + b zz = Vft • 8i?i 

and the Si-quantities are constants, the subscripts p and n being used for 
consistency with later expressions which involve other quantities associated 
with holes and electrons. The other strain components vanish by sym- 
metry in the linear approximation. This may be illustrated by two pure 
shear deformations having z xy > for one equal to — e xy for the other. 
Viewed in the z = plane, these have the effect of changing the angle 
between the x and y axes by ±50 and, by symmetry for a cubic crystal, 
these two changes should produce equal changes in & c . Hence S c must 
be an even function of s xy and the linear term in s xy must vanish. 

In the remainder of this section we shall deal with the transitions of an 
electron and will use the abbreviations of (17a ), the modifications for a 
hole being discussed in Section 17.7. We shall also neglect terms involving 
changes in the effective mass with strain. These terms can be seen to be 
of order P 2 times a function of the strains from curve & C (P) of Figure 17.5 
as follows: The minimum curve for S c for the conduction band corresponds 
to P = (or to a corner of the Brillouin zone instead). Curve & C (P) lies 
higher than this by an amount m n P 2 /2. If the curvature of & vs P 2 changes 
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with strain, the effect is equivalent of a change in m n and this change should 
be linear in the strains for small strains. For such a state we may write: 

&(P,e) = & (P) + SiA + terms in P 2 X strains. (18) 

We shall refer to &\A as the deformation potential. 

(In the diamond structure there are also "optical modes" in which the 
two atoms in the unit cell move in opposite directions. If the energy band 
considered is non-degenerate so that there is only one & function of the 
strain, one can conclude that these optical modes will have no first-order 
terms. This follows from the fact that for the long wave lengths of interest 
in semiconductors, the optical waves are produced by displacing one face- 
centered cubic sublattice in respect to the other, and symmetry permits no 
single-valued function which is linear in the displacement components. 
If the band is degenerate, however, there may be linear terms for the 
individual components 48 ; however, the average interaction will still be 
independent of the optical waves. The pursuit of these problems may 
best be made by the representation theory of the symmetry groups in- 
volved 5 and lies beyond the scope of this book.) 

We shall next prove the basic theorem of the Bardeen-Shockley method: 
The matrix element for ^i is equal to the matrix element of the deformation 
potential, provided that we deal with electron transitions involving small 
values of P and P . The integral which we must evaluate to prove the 
theorem is obtained from (9) by integrating over the ^'s: 

(P',n\^ 1 \P,n± 1) = (n\q c \n ± \)J ^ P .*(r)m c (r)^ P (r)dF T 

+ (n\q 8 \n ± \)j^ r *ir)^ s (r)-if P {r)dV T (19) 

where we have omitted the subscript * for ka, and have reversed the order 
of P and P' , both for the purpose of simplifying the following manipulations 
in which the ty on the right side is chiefly involved. 

The two integrals of (19) are very similar and we shall consider the first. 
In it Gilc corresponds to a perturbation in potential due to a deformation of 
the form 8/?; = Gt) cos k ■ Ri where 77 is a parameter which we shall let 
approach zero so as to obtain the effect of d/dq c . We shall thus consider 
7j to be a small quantity and neglect its square in subsequent expressions. 
The value of G U C is evidently d^i/dr; as 77 — > 0. The deformation itself is 

4a J. Sardeen has proposed that the "optical modes" may play an important role in the 
absorption of infrared photons by conduction electrons or holes, Phys. Rev., 79, 216 (19S0). 
E. J. Ryder and W. Shockley have also concluded that the "optical modes" are important in 
accounting for the energy losses of electrons in high electric fields, Phys. Rev., probably late 
1950. 

6 See, for example, F. Seitz, Phys. Rev. 73, 549-564 (1948). 
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defined only in terms of the nuclear motions but, for the relatively small 
fc-values and long wave lengths of interest in the excess electron case, the 
deformation varies slowly from one unit cell to the next and we may define 
a continuously varying dilatation 

A(r) = V • Gi\ cos k ■ r = — ijG • k sin k • r. 

We evaluate the integral by considering a crystal in which such a deforma- 
tion is actually present with 7? having such a small value that non-linear 
terms can be neglected. We shall then prove by an approximate method 
that 

/V(OA(r)+F(W = ^/exp [i(P' - P') ■ r/h]S,^{r)dV r 

+ terms of order r/(P 2 or P' 2 )/m. (20) 

Hence for small values of P and P', the matrix element may be evaluated 
simply in terms Si just as if the perturbation in potential were b^ir) — 
§iA(r). This proof involves considering the exact wave functions in 
homogeneously strained crystals. 

At any point in the crystal deformed by the ij-wave, there is a state of 
strain called ije(r) where s is symbolic for s xx , Sj, w e M , etc., together with any 
bodily translation and rotation of the material. The unit cell at r a is then 
deformed by »?e(r ). We shall define a wave function appropriate to this 
cell by imagining the deformed cell to be periodically repeated in all direc- 
tions to make a lattice of homogeneous strain in which the cell at r a is one 
cell. The proper wave function in such a lattice would be 

v|/p[r, ue(r„)] = e"" ' r/ *u,[r, *(r )] (21 ) 

where for small deformations \x P [r, rjt(r a )] will differ in cell r a only very 
slightly from u F (r — 8.R a ). In the cell at r a the potential is %>(r) + 77 G lt c (i r ) 
and within this cell \|/[r, J?e(r )] satisfies the wave equation 

[(l/2m)p 2 + %(r) + i,6U e (r)]i|r,,[r, ije(r.)] m [3C r + v <^ c (r)]^ P [r, *(r.)] 

= [& (P) + SnAOvOhM'. *('.)] (22) 

where we have neglected terms, like those of equation (18), of the order of 
riP 2 /m n which arise from changes in the effective mass m n with strain. It is 
to be noted that in the first line of (22) p 2 operates only on r and not on r a . 
This last equation is one of salient points towards which the attack of the 
last section and this has been directed. It is to be noted that ^(r) is a 
real function of position and that in the physical reasoning associated with 
the form S + &iA we have dealt only with such real functions. We shall 
later combine the g&c and ''U, terms and obtain an expansion involving the 
complex factor exp (ik • r) ; this procedure will be used, however, only 
after the integrals of ^ and ^ are individually considered. 
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We next define a function which takes account of the variation in e(r). 
This function is defined as 

t,|r, ? (r)] = e iP " / %[r ) ^)] 

= i|r,(r) + *|»,(r). (23) 

This function changes its form in every unit cell to conform to a periodic 
structure made by repeating that unit cell; in fact it changes its form 
slightly even in the unit cell itself because s(r) varies slightly throughout 
any unit cell. The term 5\|/ is evidently linear in 17 for small 77 and we may, 
therefore, neglect its product with other terms linear in rj. 

We prove the desired theorem of equation (20) by considering the integral 

/ = j ^ r *(r)[<X r + i^cOOhMr, v*(t)Wt. (24a) 

The result of the operation on the right-hand \|/ gives 

/ = ftyr*[S (P) + & lV A(r)]ty P + ^ P )dV r 

, . du P [r, i?e(r)] , N rnt . x 
+ terms involving rjVzyr). (24b) 

These last terms can be shown to be of the order rjP 2 /m, some of the steps 
in the proof being indicated in problem 2. Since the \|r's are orthogonal, 
and rjdtyp (being proportional to t? 2 ) is negligible we obtain: 

/ = ^ fty P .*Ay\tpdF r + S (P)fy\tpr*8i\tpdF r . (25) 

We now evaluate / again by expanding the last two factors and using the 
fact that \\tp and \\ip- are orthogonal eigenfunctions of 3C r . This gives, up 
to terms in ?j 2 , 

I = ftyp,*r,^ c 4fpdV t + Jylfp^WpdV, 

= r, J^ P ,*^pdV r + & (P')f^r*^pdF r ; (26) 

the second step involves using the Hermitian property of !JC r . (See 
problem 3.) Solving (25) and (26) for the desired integral gives 

+ [&o(P) - &o(P')] f typ-*typdK. (27a) 
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The last term is of the order t\P 2 /2m n and can be neglected. The desired 
expression for the integral of ^ is then given by dividing (27a) by rj: 

J ' typ'^^pdV, = ^J^p^pMV,. (27b) 

This result shows that the interaction integral is just what would be obtained 
by introducing a deformation potential d>iA, as was done in Chapter 11. 

We shall next combine the 6 U (; and ^ terms of (19) in a form similar to 
that for 82?i in equation (2) of Section 17.5. We also interchange the 
positions of P andP in the equations to be consistent with earlier notation. 
Equation (19) may then be rewritten as follows: 

(P,»to| 6 Ui|/ , ',»te + 8» te ) 

= Si J \|/p*»|/p'(w* a |V • G ka (q c cos * • r + q, sin k ■ r)\n ka + Sn ka )dF r 

= SiG te • kjlfp^p. exp (iSn ka k ■ r)dF T [h(2n ka +l+dn ka )/SM Uka ]^ 

(28) 

where the first equality comes from writing the dilatations for the q e and q, 
waves as divergences and the second equality from the form of the matrix 
element for q c cos k • r + q s sin k- r which is obtained from (20) of Section 
17.5 by replacing i?» by r. Some factors with absolute value unity such as 
* and i 2 have been omitted; these omissions are unimportant, however, 
since only |(28)| 2 occurs in the transition probability. 
The integral in (28) vanishes unless 

P = P' + hk5n ka 

in agreement with equations (14) and (15). When the integral does not 
vanish, its value is 

jJu P *{r)np,{r)dV T (29) 

where, it will be recalled, Up is normalized per unit volume. For a non- 
degenerate band and small values of P, u P (r) may be written in the form 

Mr) = UoOO + PxUxfj) + PyUy(r) + P 2 u 2 (r) 

where the u's may be chosen to be orthogonal, and from this the reader 
may prove that the integral (29) is unity up to terms of the order of P 2 . 
Thus we obtain finally 

(P, Htal^ili'', n ka + 8n ka ) = SjOfc, • k[h(2n ka + 1 + 5» fta )/8Mo fta ] H (30) 

for the allowed transitions. The allowed values of bn ha are ± 1 [see equa- 
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tions (16) to (21) of Section 17.5] and M is one half the mass of the crystal 
[see equation (11) of Section 17.3]. This formula is equivalent to that 
appearing in the literature except for the interpretation of Si. 6 

17.6b. Final Expression in Terms of Elastic Constants. For the waves 
of interest in semiconductors, the hk values are small compared to h/2a, since 
the changes in P are small, and consequently the wave lengths of the 
ft-waves are large compared to a. For these waves the solid may be 
treated as a continuum so that macroscopic elastic theory may be used. 

As has been evident from the treatment in deriving the matrix element, 
the important waves are those which produce dilatation and have a large 
component of their unit vector G ka parallel to k. In an isotropic solid, the 
three possible polarizations for long wave-length waves consist of a purely 
longitudinal mode and two purely transverse modes; and in a cubic crystal 
the same situation is true for waves propagating in the [100], [110], and 
[111] directions, which lie along symmetry axes. For other directions 
there is a slight mixing of longitudinal and transverse polarizations. For 
germanium the elastic constants and sound velocities are given in Tables 
17.1 and 17.2. [For data on Si see Phys. Rev. 83, 1080 (1950).] 

Table 17.1 Elastic Constants or Germanium 
(dynes/cm 2 ) 

en = 1.292 X 10 12 c n = 0.479 X 10 12 c u = 0.670 X 10 12 



Table 17.2 Velocities of Elastic Waves in Germanium 
(cm/sec) 



Polarization 


Direction of Propagation 
(Parallel to k) 




[100] 


[110] 


[111] 


Longitudinal 

Transverse [001] 
[110] 


4.92 X 10 6 (a) 
3.54 X 10 5 (d) 


5.39 X 10 6 (b) 

3.54 X 10 6 (e) 
2.75 X 10 5 (f) 


5.54 X 10 5 (c) 
3.03 X 10 5 (g) 



The formulae for pc 2 are: 

(a) en, (b) (fu + c n + 1cu)/1, (c) (<m + lc n + ic u )/3, (d) cu, 
(e) C44, (0 ('a - cu)/2 (g) (di ~ c n + cu)/3 

p = 5.35 gm/cm 3 as deduced from the lattice constant. 

Data from W. L. Bond, W. P. Mason, H. J. McSkimin, K. M. Olsen, and G. K. Teal, Phys. 
Rev., 78, 176 (1950). 

6 5>i is equal to C as used by A. H. Wilson, The Theory of Metals, Cambridge University 
Press, 1936, and Si = f C for C as used by A. Sommerfeld and H. Bethe, Handbuch der Physik, 
Vol. XXIVi, and as used by F. Seitz, Phys. Rev. 73, 5*9-564 (1948). 
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For other directions of propagation, the longitudinal velocity will lie 
between the extremes at [100] and [111] and, although the waves will not 
be perfectly longitudinal, the value of (G ka ■ ft) 2 , which enters the transi- 
tion probability, will be nearly equal to ft 2 and since 

(G« • ft) 2 + (G k2 • ft) 2 + (G ft3 • ft) 2 = ft 2 , 

only very small contributions will be made by the chiefly transverse waves 
to the (G ka ■ ft) 2 terms. We see, therefore, that the customary approxi- 
mation of treating a cubic crystal as isotropic and using only the longi- 
tudinal waves for scattering will introduce only relatively small errors. 

Since the longitudinal velocities do not differ greatly in different 
directions, we shall introduce an average longitudinal elastic constant en- 
For cu we take arbitrarily the value for a longitudinal wave in the [110] 
direction, this value being intermediate between the extremes of [100] and 
[111]. We also introduce an average velocity c = (cu/p) y2 . 

If we neglect the transverse waves, we see that, for each transition 
P toP', orP to Pj in the notation of Section 17.2, there are only two pos- 
sible changes in the phonon field given by 

hk=P' -P for 8n k = +1 

hk=P -P' for 8n k = -1. 

Thus, except for this ambiguity, the end state is uniquely determined by 
the value of P' . We shall shortly discuss how the two matrix elements 
can be treated as the single matrix element j G li y | 2 of Section 17.2. 

In terms of c u , the matrix element (30) can be put in a form which is 
more easily remembered. For this purpose we shall introduce the average 
energy (S te ) of the normal mode before and after collision, where {& ka ) 
= hw^iln^ + 1 + hn^/T. = hw^n^ + \ + nj + |)/2. In addition 
we note that (I) M = M c /2 = \ P V where p is the density of the crystal 
and V its volume and (II) co^/ft = c^ = Vc a /p where c^ is the speed 
of wave ka. In terms of these relationships, equation (30) leads to 

|(P»jJ%|j"»ita + Sn ka )\ 2 = &i 2 hw ka (2n ka + 1 + bn^/Wcu 

In this expression (A 2 ) is the average squared dilatation associated with 
energy {&)«,) and is defined as follows: For a mechanical wave, the energy 
is, on the average, equally divided between potential and kinetic energies. 
The potential energy density is %c u (B hk ) 2 where s kk is the longitudinal 
strain and is equal to A the dilatation for a longitudinal wave. Equating 
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the average potential energy to half the total energy gives 

which leads to the last equation above. 

The result may be put in words as follows: The matrix element squared 
is obtained by computing the mean squared dilatation associated with the 
energy in the running wave averaged before and after the collision. This 
mean squared dilatation (A 2 ) corresponds to a mean squared deformation 
potential 6>i 2 (A 2 ) and the matrix element is half this latter value. Stated 
in this form, the result will apply to waves whose polarization is only 
partly longitudinal, as the reader may verify. 

In Section 17.2 we used the approximation that the change in energy of 
the transition P — > P' could be attributed entirely to the changing energy 
of the electron. This can be shown to be a good approximation for the 
conditions of interest by considering first the requirement that energy be 
conserved in the transition so that the change in &(P) equals ±ha ka . 
For the spherical approximation with &(P) = P 2 /2m, we may write the 
energy in terms of the change S\P\ in the form 

5g(P) = |/>|«|j»|/»i = \v\8\P\. 

For the lattice wave we have a change in energy of 

Tko = h2rc/\ = hc\k\ = c\P' - P\. 
Equating these two energies gives 

8\P\ = A|P'-/ , |- (32) 

\ v \ 

At room temperature, |t>[ is about 10 7 cm/sec whereas c is about 20 times 
smaller. Thus the changes in |P| will be only a few per cent of |j°|. Figure 
17.6 shows the end states P' resulting from P. For large values of \P — P'\, 
k is large and the two surfaces are separated so that 8&(P) = 2fkc k . If 
the value of P deviates from one of the surfaces, the change in total energy 
will vary owing to & (P) as V ■ 8P and owing to hw ka as cX (component 
of 8P parallel to k). From this we see that the changes in energy arise 
largely from changes in \P\. Since the separation of the two surfaces is 
only a few per cent of \P\, we may take the two surfaces as being a com- 
mon sphere. For very low energy electrons, however, |v| will be com- 
parable to c. Under these conditions the two surfaces take the form 
shown in (b). For still lower energies, the surface takes the form (c) 
and no transitions in which the electron loses energy are possible. These 
interesting cases are of importance only at very low temperatures and lie 
outside of the range of phenomena with which we are concerned. 

7 See F. Seitz, Phys, Rev. 73, 549-564 (1948) for a farther discussion of these points and a 
list of references. 
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Accordingly we approximate the end states fori*' by a sphere. Further- 
more, two independent processes, corresponding to 8n = +1 and hn = — 1, 
contribute to the transitions from P to P' . These two probabilities should 
be calculated separately using the absolute values of the squares of the 
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Fig. 17-6— Initial and Final Values of P for Transitions which Conserve Energy. 

(a) The two nearly spherical sheets corresponding to emission and absorption of a 
phonon with |Po| = 6mc. 

(b) The two sheets for an electron with |P | = 3wf. 

(c) For an electron with |P | = mc, energy loss is impossible. 

(d) Case of P = 0. 

matrix elements and then added. The result will be the same, however, as 
that obtained by simply adding the squares of the absolute values of the 
two matrix elements. The effective value of |% | 2 used in Section 17.2, 
in which P and P' are replaced by P and Pj, is thus 

io| 2 = | (iV J %!/>*„ + 1)| 2 + ItoaL^hlilPya-*. - 1)| 2 



N 



-Si 2 



2^fa*(»*a + 1) + ho}_ ka t1_ ka 



IVci 



(33) 



|%o| 2 = 5 1 2 ^- = Si 2 <A 2 )-^; A = -±- (34) 
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Since the waves ka and — ka have the same frequency, the two w's will be 
equal. Furthermore, the energy of the phonons for the important transi- 
tions is of the order of hoi = hc2ie/\ = hc\k\ = c\P\ and, since this is much 
less than |v| |P| = kT, the lattice waves of interest are well above their 
characteristic temperature and consequently 

ftwjteCw/w + h) = kT. 

Using this equation to eliminate (2»k, + 1 ), we obtain the expression used 
in 17.2 except for the change in notation, with P,P' in place of Pq, Pj of 17.2: 

kT e2/,2x AT Si 2 * 

— = Si 2 <A 2 > s — ; A = — 
Vcu V re i 

where A is independent of the temperature. The value of (A 2 ) corresponds 
to an elastic energy of kT/2 in form of cuz kk 2 /2 where the use of cu for all 
directions of propagation follows from the isotropic approximation. Equa- 
tion (34) is twice as large as (32) as a result of combining phonon absorp- 
tion and emission transitions. 

Inserting this in the expression for the mean free path derived in 17.2 
gives 

1 = ATm n 2 = & ln 2 kTm n 2 
I(T) wh* rh*cu 

where we have replaced 6>i by &i„ to remind the reader that this formula 
was derived for an excess electron. In the next section we shall show that 
the formula for a hole differs only by replacing S in by S lp and m n by m v . 

Problems 

1. Calculate the electrostatic potential variations in the crystal due to a 
?std wave of long wave length. 

Consider the potential WoOO • 8-Ro- This is the change in potential in 
the crystal due to giving nucleus zero a vector displacement 8/?o. Of 
course, W will be largest in cell zero where the displacement occurs. At 
large distances from the atom, the nuclear motion alone would be equivalent 
to producing an electric dipole of magnitude -\-ZehR . This dipole is 
largely neutralized by compensating shifts of the core electrons, Z — 4 in 
number, and the valence electrons which are 4 in number. The core 
electrons will move by almost exactly the same amount as the nucleus and 
will reduce the moment to 4«8/? - If there were no interaction between 
one and another of the four covalent bonds around the nucleus, the valence 
electrons would exactly cancel the remaining AehRo; this follows from the 
fact that the two bonding electrons between atom zero and its neighbor, 
atom one, have their center of gravity midway between the two atoms 
and will thus move half as far as atom zero and produce a dipole moment 
of 2( — e)^hRo = — ehR ; consequently, four covalent bonds would just 
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cancel the 4e8R due to moving the core. There will, however, be interac- 
tions among the bonds so that if atom zero (say the central dark atom of 
Figure 1.3) moves directly towards one of its neighbors (say the atom in 
Figure 1.3 nearest the reader), it will tend to decrease the angles between 
the bonds to the three atoms it moves away from and this squeezing 
together of the bonds will cause, through the exchange effect, the electron 
density of the three bonds to shift by less than %&R . Consequently, a 
very small net dipole moment, denoted by e'&R where e' -C e, will be left 
over because of the nuclear motion. 

Suppose, therefore, that at large distances from R , $R ■ W (r) is the 
field due to a dipole e'hR , embedded, of course, in medium of dielectric 
constant k. Then the wave 8i?, = Gq c cos k ■ R t gives rise to a wave of 
polarization. Show that this polarization produces a potential 

<p(r) = (Aire'N s /KVk 2 )(k ■ G)q c sin ft • r 

= -(WiV s //c^ft 2 )A(r) 

where NJV'is the density of nuclei per unit volume and A(r) is the dilata- 
tion of the wave. Show that for the wave whose k will reverse an electron 
of energy kT at room temperature in germanium, <p is equal to: 

<p = 5.1 X 10 -2 (//<?) A absolute e.s.u. volts 

= I5(e'/e) practical volts. 

Since e is probably only a few per cent of e, this term is negligible compared 
to the terms considered for the slope of the bands. However, k 2 varies 
directly as T, the absolute temperature, so that this term might be impor- 
tant at temperatures of liquid hydrogen or below. 

2. Show that the terms in du P (r,e)/ds are of order i)P 2 /m. (Equation 
(24b) of Section 17.6.) 

First note that the terms involving (d/ds) are multiplied by Vs and that 
Vs °c k and that hk and P are of the same order. Hence note that the only 
terms which do not lead directly to order r)P 2 have factors like 

d 2 

and that these lead to integrals similar to those for p in the expression for 
v = V P tb(P) and hence to terms no larger than i)P 2 /m n . 

3. Prove that 
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by considering the periodicity of 5\\ip and the effect of this upon integrating 

by parts over the volume of the crystal. 

4. In equation (26) of Section 17.2 the effect of the states into which a 
state bj might decay was replaced by a term — ihrijbj which leads to an 
exponential decay of the form exp (—nji) for bj. Although this pro- 
cedure is correct, as shown by equations (11) to (17) of Section 17.2, 
when a single initial state decays to a set of final states, it requires scrutiny 
when a group of initial states decay to the same set of final states. We 
shall use the coefficients b for states to which a can make transitions and c 
for those to which the ^-states can make transitions. Then it is evident 
that there are four classes of c-states: (i) those having two more phonons 
than «o (corresponding to two transitions with phonon emission) ; (ii) those 
having one more phonon of one type and one less of another type; and 
(iii) those having two less phonons. Consider a state of type (i) with 
coefficient denoted by c (ft, ft') which has one more phonon each of type ft 
and type ft' than «o- This may arise in either of two ways as may be seen 
from the following scheme: 

«o Kk) *(*') c(k,k') 

P P -hk P -hk' P -h(k + ft') 

n k n k + 1 n k n k + 1 

n k > n k > n v +1 n k> + 1 

Each column above describes a state and the state c(k, ft ) may arise 
either by 

ao-+i(k)-*c(kk') 
or by 

«o -» b(k') -» c (ftft'). 

This shows that the decay of any state b{k) into a set of states c{kk') may 
be influenced by the effect of other b(k') which may also decay into the 
same set of states. If this effect is sufficiently pronounced, it will mean that 
errors may be introduced by assuming that each b decays with its own 
characteristic r\ as expressed in equation (26) of Section 17.2. 

The problem left to the reader is as follows: The important £'s and c's. 
are those for which the energy of the system is the same as for state «o> 
all other b's and c's being relatively small. Each b (ft) may be said to 
excite its own set of c(k, ft') which cause it to decay. Show that any 
important c(k, ft') arises in general chiefly from only one £(ft) and that the 
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contribution from the one other possible b{k') is small. Hence show that 
the c's which make any one b(k) decay arise chiefly from that b(k), so that 
the effect of the other b(Jk') upon the decay oib{k) is negligible. Show that 
similar conditions apply to (ii) and (iii). 

This reasoning indicates that the decay of any b may be represented by 
a term like ih-ijb in (26) of Section 17.2 and hence that the equivalence of 
the classical and quantum collision theories can be extended arbitrarily far. 

5. Consider the allowed transitions as represented in Figure 17.6 for 
cases in which \P\ is comparable to mc. Show that, for P§ initially in the 
jy-direction and for collisions with P' in the P x P y plane, the curves of 
Figure 17.6 are the solutions of the equation 

± C \P'-P \=±(P'2-P ( ?) 

2m 

where hn = =F 1 and that these are intersections of a parabola of revolution 
S = (P x 2 + P y 2 )/2m with a circular cone having its vertex on the parabola 
at P y = |Po|j Px = 0, S = P 2 /2m. For values of?' along the P y axis 
show that there are two solutions, one of which always corresponds to 
phonon absorption with 8n = — 1, and that the other changes from phonon 
emission to phonon absorption when \P \ = mc, corresponding to tangency 
between a line of the cone and the surface of the parabola of revolution. 
Verify from this that the curves of Figure 17.6 are qualitatively correct. 

17.7 THE EFFECT OF THE PAULI PRINCIPLE UPON THE 
MATRIX ELEMENTS 

In this section, we shall investigate the changes in the scattering theory 
which are required when many electrons are present. We shall treat first 
the simplest case, that of a single hole, and show that it reduces to a form 
substantially equivalent to that for an electron. On the basis of the 
manipulations used in establishing the transition probability for a hole, 
we shall in Section 17.7(b) make a generalization to the case of a partially 
filled band and obtain results which form the basis for the treatment given 
in Section 11.2. 

17.7a. The Case of a Single Hole. For the case of a single hole, we con- 
sider the set of 2N wave functions constituting the valence band and repre- 
sent them by the following expressions (which are slightly modified from 
previous sections in order to avoid three levels of subscripts): 

X(«; q c ) =ty(P a ; r c )y a (sc), (1) 

where q c = (r c , s c ) represents the position vector r c of electron c with 
respect to the center of gravity of the crystal (not including the mass of 
the electrons under consideration) and s c the spin. The index a runs 
over the basic lattice of P values twice, once for plus spin with y a = & 
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and once with minus spin with "y a = p, a and p being the spin functions 
denned in Section 15.7. When only the functional form of x need be 
considered, x is written as x («)• (We shall use indices a, b, c for electrons, 
reserving i and j for atomic vibrations.) The definition of tJ/(P ; r) = 
typ a ( r ) S' ven in equation (3) of Section 17.6 normalizes \{a; q). As 

for Section 15.7, we mean by I dq integration over volume and sum over 

spin. 

The electronic wave function for the electrons must be an antisymmetri- 
cal function; and since only one hole is present, it can be characterized 
by the missing wave function, as was done in Section 17.8 in connection 
with equation (11). We shall write this wave function as follows: 

A (<Ze's) = . l Zm<&) 'Vx^i %c) (2) 

V(2N - 1)! c=1 

where e c includes all x's except x(«), and £{S) means the sum over the 
permutations 3. of the 2N — 1 sets of coordinates in the determinantal 
wave function discussed in equations (7) and (8) of Section 15.7. 

The wave function for the normal modes of vibration may be taken to 
be that used in the last section [equations (4) and (5) of Section 17.6 or 
(22) of Section 17.5]: 

<|>(«'s) = II §{n kiaj , n-kiaj', Viatic qkiajs) 

= n<j>,. (3) 

i 

As in the last section, each elementary process will be seen to involve the 
change of only one of the n's. (We shall use ('s) to indicate complete sets 
of w-numbers and the complete sets of coordinates.) 

The wave function, except for the motion of the center of gravity of the 
entire system, is thus 

*(/»„, »'s) = AJ% = A a <t>(«*s). (4) 

i 

This is an eigenfunction for the Hamiltonian 

Xo = £DC„ + Z3Q (5) 

c i 

where the 3C,- terms are the harmonic oscillator Hamiltonians and 

3C r . = (l/2m)p c 2 + %(r c ). (6) 

For the case of an almost filled band, each % term must include the aver- 
age effect of all the other electrons in the band as well as the potential due 
to the nuclei, core electrons, and valence electrons in other bands, if any. 
Thus one term omitted from 3C arises from variations in ^ofo) from its 
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average value due to fluctuations in the positions of the other electrons. 
We shall refer to this term as e U 2 . The exact potential energy for all the 
electrons can then be written as 

Glifo's, qi's) = E%(r c ) + E^l(r c , qi's) + ^(Tc's) 

c c 

= E%(r c ) + %r(r c 's, q/s) (7) 

c 

where "^Uir represents the total perturbation energy. The pair of terms 
^ofo) + ^lfo, qi's) represents the potential energy of electron c due to 
interaction with the nuclei and the average positions of all other electrons. 
This gives to ^o and ^i the same meaning as that introduced in Section 
17.4, except that we shall now consider that some of the electrons whose 
average behavior is included in these terms may also undergo transition. 
G \l 2 represents the additional effect of fluctuations in the instantaneous 
positions of the electrons in the valence band. Contributions to ^x pro- 
duced by displacements of the nuclei will include shifts in the average 
charge density of the valence band electrons, which are bound to the 
nuclei and whose wave functions deform as the lattice vibrates. (See 
problem 1 of Section 17.6 for a discussion of such deformations.) The 
^i terms are the important terms and we shall show that for the case of 
one hole, no transitions are produced by the perturbation 6 U 2 - 

(Another term which has been omitted from tJC arises from the inertial 
interaction between the electrons in the reduced mass treatment. This 
is negligibly small and is disposed of in problem 1.) 

We shall now calculate the matrix element connecting two states 

* = AJ% and #' = AOI<(>/. (8) 

i i 

The perturbing energy is 

%r = E%0-o qi's) + silaCr/s) (9) 

c 

and the matrix element is 

(i > a » , s|% T |P h «"s) = J*** G U ir $V(allg's and q's) (10a) 

= jA a *^*(n's) G li 1T A b ^(n"s)d (all q's and q's) (10b) 

= W i ^7!/[ L{2i£(a) ? x( ^ ; ^ ) ]^ 



X 
X 



X{S}e(S)n X (/- d ; Sqjln+j 1 

d J J 

E^ifo, qi 's)+ GUafo's)! d(n\\q's and q's) 



(10c) 



17.7a] 



THE MATRIX ELEMENT FOR A HOLE 537 



where a is omitted from the e c series of x's in the first bracket and b from 
f d series in the third bracket. In this formidible expression, almost every- 
thing integrates either to zero or unity. 

First we shall eliminate ^ ; evidently ^ will give zero if any n changes. 
Furthermore, it will be zero unless Y.P is the same when summed over all 
states in A and A ft ; if the sums are different, a change in coordinates 
made by displacing every electron by a period R{ of the crystal lattice will 
multiply each wave function A by exp [i(Yf) ■ Ri/ti], but will not affect 
6 ll 2 - Because of the periodic boundary conditions, this displacement 
must not change the integral; hence if A *A 6 is changed by the /?, dis- 
placement, the integral must be zero. For wave functions corresponding 
to one hole, however, £P is simply — P„ for A a and — P b for A b so that 
unless P a = Pb the % term vanishes. If P a = Pb, there is, however, no 
transition since the n's cannot change. Thus all that ^ does is to affect 
the average energy of the state and this effect, presumably, has already 
been included in ^ofo)- Hence we can forget G U 2 for the case of one hole. 

Next we shall show that each of the ^lfo, q/s) terms makes the same 
contribution. This result appears obvious from symmetry considera- 
tions, since all electrons are treated alike in the wave functions A a and A;,. 
It may be established rigorously by considering the contribution of any 
representative term in (10b), for example, ^iOv, q/s). We shall show 
that the contribution will be the same as that of % (r g , q/s). To do so we 
first simply relabel q f = (r f , s f ) and q g = (r g , s g ) by interchanging the 
subscripts. This does not affect the value of the integral since in both 
cases r f and r g are integrated over the interior of the periodic crystal with 
volume V = A x A y A z and the spins are summed over ±|. The integral 
obtained from %(//, q/s) by the interchange differs from that due to 
%(/!»?/s)> however, in having q f and q g interchanged in both A tt * and 
A b ; since both of these interchanges multiply the integral by ( — 1), the 
integral from %(»/, q/s) may be made identical with that from ^(jg, q/s) 
by interchanging q f and q„ in A a * and A 6 . The same reasoning shows that 
all ^i terms give the same value. 

We may, therefore, consider only one term in the ''UiO'e, q/s) series, say 
qi^n, q/s) and then multiply the result by 2N - 1. There are (IN - 1) ! 
products in the sum in the first bracket of (10c) and {IN - 1)! products 
in the second bracket, so that ^i^i, q/s) is involved in (/IN - I)! 2 
terms altogether. Since the x' s are an orthonormal set, the integral 
over q 2 • • • ?2At-i will vanish for each of these terms unless the two sets 
of x*'s and x's (each 2N — 2 in number) in which these q's occur in 
(10c) are identical and arranged in the same order. When they are in 
the same order, the integral over these q's is unity. The only set of 
X's common to A 3 and A b is that obtained by eliminating x(«) and xW 
from the total set of 2N x's. Hence (10c) vanishes unless x*(^! .?i) 
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occurs in the first bracket of (10c) and x( a > tfi) ln the second. It still 
vanishes unless the permutations 2 and § are such as to pair properly the 
2N — 2 remaining q's in the common x's sets. There will be, of course, 
(2N — 2)! ways of choosing simultaneously 2. and S so as to do this; 
and for each of these s(2.)e(S) will have the same value (a result which the 
reader may verify); this value will be either +1 or —1 depending upon 
the number of interchange necessary to bring qi into \*(b), an d u P on the 
number then required to bring q\ into x( a ) an d m addition to arrange the 
other q's in the x's of (10c) to match the q's in the x*'s. Since the sign 
of the matrix element is not important in the transition probabilities, we 
need not establish rules for determining it. 

We shall now rewrite the matrix element (10) by dropping ^j evaluat- 
ing G lii(gi, q/s) only, multiplying by (2N — 1) to include the effect of 
the omitted ^i's, and multiplying by (2N — 2) ! to account for the differ- 
ent ways of pairing q 2 to q2N—i- These factors give (2JV — l)(2iV— 2)! = 
(IN — 1) ! which just cancels the coefficient in (10c). The matrix element 
thus becomes 

(P a ,n's\ s U 1T \P b ,n"s) 

= ± fx*V;ru Ji)n*i*%(ri, ?.-'s)x(«; r u si)Tfy/<lqid (ally's). (11) 

The ± sign, arising from e(2.)s(§) as discussed above, does not affect the 
transition probability. It may be noted that, in the integration, A„ 
reduces to x*(^) an d &*> t0 X( a )- The interpretation of this interchange 
of a and b is that the transition probability from a to b for the hole is 
identical with that of an electron from b to a. Further inspection of (11) 
shows that it is of essentially the same form as the matrix element treated 
in Section 17.6; if ^i is considered as linear in the q/s, then the same selec- 
tion rules will apply and (11) will reduce to (9) of Section. 17.6 in which 
only one n of the running wave scheme changes. 

There is also a selection rule for the spin. Since ^i does not involve 
the spin, the integral vanishes unless \(a) and x(^) both contain a or 
both contain p. If spin is conserved, the sum over the spin coordinate 
eliminates the spin functions and replaces the x's by the corresponding 
ty(P) = \|fj> functions. 

It is thus evident that the matrix elements for the transitions of a hole 
from P a to Pb are identical with those of an electron from Pb to P a . This 
latter may be calculated by precisely the same methods as those used in 
Section 17.6, the only difference being that wave functions near the top of 
the valence band are used instead of wave functions near the bottom of the 
conduction band. 

Because of the equivalent form of the matrix element, it is evident that 
the methods of Section 17.2 can be applied to the transitions of a hole. 
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This will lead to an expression like (35) of Section 17.6 for the mean free 
path of a hole, except that the rate of change of S„ with dilation will 
replace the rate of change of S c with dilation in Section 17.6, equations 
(17) and (27). The two expressions for the mean free path will thus be 

1 & ln 2 kTm n 2 



l n (T) xft 4 *n 

1 & lp 2 kTm p 2 



(12) 
(13) 



l p {T) rh*cu 

One important difference between the hole transition and the electron 
transition should be mentioned, however. For the electron transition the 
quantity 

P + ftX>*„fe (14) 

is conserved; owing to the interchange of subscripts a and b on the right 
and left sides of (11), the conservation law for holes applies instead to 

-P + hZn ka k. (15) 

As pointed out in Section 15.8, however, the natural momentum to asso- 
ciate with the hole is Pb = —P where P is the momentum of the omitted 
wave function and Pb the momentum of the remaining electrons in the 
energy band. This value of P B is consistent with the picture of positive 
charge and positive mass for the hole. We see now that it is also consistent 
with the conservation of the quantity 

Pb + ft£»*-*- (16) 

This shows that the formal equivalence of the behavior of holes and elec- 
trons can be made complete, so far as scattering and acceleration are con- 
cerned, by making use of P B . Considerations of this formal equivalence 
do not appear to afford any greater insight into the mechanism, however, 
and we shall not develop them further here. 

Before proceeding to the case of many electrons or many holes, it should 
be pointed out that the reduction of the many-electron matrix element 
(10) to the one-electron form (11) did not involve any reliance on the 
specific form of %i, save that the individual terms depended on the co- 
ordinates of a single electron. The fact that ^i was linear in the normal 
modes entered only in showing how (11), or its counterparts in Section 
17.6, lead to selection rules for P and the w/s. Consequently, (10) will 
reduce to (11) for scattering due to impurity ions, lattice imperfections, 
such as dislocations, or any other causes of deviation from the perfectly 
periodic potential. From this we conclude that the scattering of a hole by 
these imperfections will be the same as the scattering of an electron, except 
that wave functions of the valence band rather than of the conduction 
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band will be involved. This fact is used in Chapter 1 1 in a simpler form in 
discussing the impurity scattering of holes. 

17.7b. The Intermediate Case — Many Electrons and Holes. If the 

band is partially filled so that an appreciable fraction of the quantum 
states are occupied, the effect of the Pauli principle in preventing transi- 
tions of electrons to occupied states becomes important. We shall show 
here that the effect of the Pauli principle on the matrix elements is con- 
sistent with the description given in Chapter 11, in which the effect of 
having the states partially occupied was accounted for by reducing the 
rate of electron transitions by a factor 1 — f, corresponding to the fraction 
of end states available. The treatment of this section does not go as 
deeply into its problem as did the treatment of Section 17.2, in which the 
equivalence of the results of quantum theory and the classical model was 
shown with considerable generality by deriving and comparing the time 
dependence of the distribution of end states for the two cases. For the 
case of many holes and electrons, the detail involved in carrying out a 
similar comparison appears prohibitive compared to its value for expo- 
sitional purposes and we shall proceed only to the calculation of the 
matrix elements. From these matrix elements we shall infer the desired 
transition probabilities by applying the formulae which give the transition 
probabilities for one electron in terms of the matrix elements for that case. 
Accordingly, we shall deal with M electrons which may occupy a frac- 
tion of a set of IN quantum states which arise from an energy band, or 
possibly from several overlapping energy bands. The antisymmetric 
wave functions for the electrons will now be 

A = A(«i, • • • a M ; <Zi> • • • ,Qm) (17) 

where the indices a describe the M wave functions of form x(a), see equa- 
tion (1). The order of the indices a\ to a.M in A is immaterial since the 
only effect of altering it is to introduce a factor ±1 in the determinant 
which A represents [see Section 15.8, equation (8)]. 

The wave functions for the normal modes of atomic vibration will once 
more be expressed in terms of the quantum numbers for the running 
waves, equation (3) of this section [or (4) and (5) of Section 17.6 or (22) 
of Section 17.5]: 

<}>(»'s) = II<|>(» faaj .,M_ faa} -; qkicjc, ?fa af «) 

ij 

= 1%. (18) 

i 

The wave function representing the initial state is then 

§ = A(«i, ■ ;a M ;qu- ■ ■ , Qm )4>(»'s) = M- ( 19 ) 
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The perturbation energy will be the same as before and can be written as 

M 

%r =. E %(»•„ ?i's) + Gtlafa, • • • , r M ), (20) 

c=l 

the terms having the same interpretation as before for equation (9). 

The matrix elements of interest will be those connecting two $'s in 
which one or more as change and one n changes by ±1 ; this last selection 
rule evidently follows at once from the linear dependence of ^(Tc, q/s) 
upon the q's. For the case of one hole, it was possible to disregard ^2- 
This may still be done for the purposes of conductivity theory if the band 
is nearly full or nearly empty and the spherical energy surface approxima- 
tion is made; for transitions induced by G \l 2 > there will be, as discussed 
above in connection with (10b), conservation of J^Paf For the spherical 

i 

approximation for the energy surfaces, there is a linear relationship be- 
tween P a and v a , so that 

I>« = (1/*»)EP„ or £»« = (-l/m) P ZP a . (21) 

Hence electron-electron (or hole-hole) collisions, which are produced by 
6 U 2 > do not alter the net motion of the electrons or the current and, there- 
fore, do not contribute to the relaxation of current. [The reader may 
verify that this significance of (21) is not altered by recentering the 
Brillouin Zone at a corner.] If, however, the energy surfaces are not 
spherical, or if the bands are degenerate so that several values of &(P) 
and v(P) must be considered for each value of P, G U. 2 might have an im- 
portant influence. The dependence of mobility upon temperature indi- 
cates, however, that such effects are not significant in germanium and 
silicon nor in ordinary metals and we shall disregard them. 1 

If we can neglect G ll2, considerations precisely the same as those used in 
going from equation (10) to equation (11) may be applied. This leads 
to the conclusion that the matrix element vanishes if more than one of the 
as changes, because if two or more change, then there will be no common 
set of M — 1 x.'s in A* and A' in the matrix element. Hence we conclude 
that matrix elements occur only for transitions for which one of the as 
and one of the n's change. For these the matrix element will evidently 
reduce to form (11) which, because of the selection rule for n, is equiva- 
lent to (9) of Section 17.6 and involves the initial and final wave functions 
for the electron and the harmonic oscillator wave functions for the only 
mode which changes. The allowed matrix elements are thus identical 
with the set obtained by considering one-electron transitions from the 

1 Scattering in metals in which several bands are involved are discussed for example, in 
N. F. Mott and H. Jones, The Properties of Metals and Alloys, Oxford at the Clarendon Press, 
1936, p. 265. See also J. Bardeen, "Conductivity of Monovalent Metals," Phys. Rev. 52, 
688-697 (1937) for a discussion of electron-electron interactions. 
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initial occupied states, represented by the set a\, • ■ ■ , a r , • ■ ■ , cim to a new 
set «i, • ■ ■ , a r , • • ■ , aM in which only one a r changes, and that to an un- 
occupied state of the same spin. 

As was seen in equation (23) of Section 17.2, the probability of transition 
from an initial state to a set of final states is proportional to the density of 
final states and the square of the absolute value of the matrix element. 
Applying this result to the case of many electrons, we conclude that the 
density of available final states for any one electron transition (a r — * a r ) 
will be reduced by (1 — f) since a fraction f of the states which might be 
selected for a r are already occupied. Thus the transition probability 
will be the same as it would be for one single electron in state a r , except 
for the factor (1 — f). This is physically equivalent to the argument 
used in Section 11.2 which stated that the distribution of electrons made 
transitions just as if each electron acted independently, except for the 
restriction that transitions to occupied states were forbidden. 

We have thus justified to a certain extent the use of the classical model 
of Section 11.2. To review and criticize the argument we point out that 
what we have done on the analytic side is to show that the matrix element 
connecting two states of the entire system differing by having one elec- 
tronic state \{a r ) replaced by another (a r ) is identical (except for a factor 
±1) with the matrix element for a system with one electron only in which 
the one electron makes a transition from \{a r ) to \{a r ). (It is assumed, 
very artificially, that the potential energy for one electron is the same for 
both cases.) It also follows that because there are no possible wave func- 
tions in which two electrons are in the same end state, there are no matrix 
elements for transitions to already occupied states. It was pointed out 
that the transition-probability formula involves the density of end states 
so that a factor (1 — f) is introduced to account for the reduction of 
available end states. 

However, formula (23) of Section 17.2 for the transition probability 
was justified by showing the equivalent behavior of the classical and 
quantum-mechanical models. This equivalence was demonstrated for 
the case in which only one excess electron (or hole) was involved. In 
order to establish the same degree of equivalence for the case of many 
electrons, it would be necessary to extend the Weisskopf-Wigner treatment 
to many electrons and, as stated earlier, the increased rigor obtained by 
such a treatment does not appear to warrant the complications involved. 

Problem 1. The reduced mass problem for many electrons interacting 
with the crystal leads to a Hamiltonian with non-diagonal terms in the 
electronic momenta. Let the coordinates of the electrons in respect to a 
fixed origin be r e i, r e2 , etc., and that of the crystal be R c . Then r a = 
r ea — R c is the proper coordinate to use for the position of an electron in 
respect to the crystal and is the r a referred to in this section. Show that 
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the equations which should be used in place of (4) to (7) of Section 17.4 are 

pT = Pc + UPea 

a 

Pa = Pea ~ (m e /M T )P T 

r T = (M c /M T )Rc + (m e /M T )Y,r e a 

fa = Tea ~~ **c 

and that the Hamiltonian becomes 

3C = (\/2M t )Pt 2 + V/2m)ZPa 2 + (l/2Af e )E ZPa-pb + ^fo's, q/s) 

where m = m e M c /(m e + M c ) as before. The effective mass in the 
p a ■ pb terms is so large that it represents a negligible correction. 



APPENDIX A 
UNITS 

(a) The Basic Rule for Changing Units. 1 The problem of changing the 
measured value of some quantity from one system of units to another con- 
tinually occurs. The work can be systematized with the aid of the follow- 
ing idea: Measurement consists of comparing quantitatively the thing 
studied with reference units of length, mass, time, temperature, etc. In 
order to discuss the relationship between measurements made in two 
systems §1 and §2 we shall introduce the following conventions: measured 
numerical magnitudes of scalar and vector quantities will appear in italics L 
whereas symbols representing the units themselves will appear in boldfaced 
roman L. Thus the length of a 12-in. ruler measured in the c.g.s. system is 
written as 

UU (l) 

where L\ = 30.5 and Li = 1 cm. Conversion factors between units will 
also be written in boldfaced roman with double subscripts; for example if 
L 2 = 1 in. we write 

Li = L 12 L 2 , L 12 = (2.54)- 1 . (2) 

Conversion factors between the measured magnitudes of the same quantity 
measured in two units will be written in lightfaced italic with two sub- 
scripts; for the 12-in. ruler we have 

U = £12^2, U = 30.5, La = 12, L 12 = 2.54. (3) 

If one and the same quantity is measured in the same way in two sets of 
units, relationships of the form 

UU = L 2 L 2 (4) 

always hold. From this we deduce that 

Z.12 = L12 = L21 = L21. (->) 

The example given above applies to situations in which the units are 
changed while the measurements are carried out by the same operational 
procedure. Another possibility, however, is to keep the units the same 
while changing the operational procedure. For example, consider a circle: 
If it is an automobile tire, its size might be measured by calipering its 

1 For more detail see P. W. Bridgman, Dimensional Analysis, Yale University Press, 1922. 

545 
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diameter and dividing by two thus obtaining 

(Size) equals L{L\ = 16 in. (6a) 

which measure its radius. Equally well its circumference might be 
measured giving 

(Size)' equals U'U = 100.5 in. (6b) 

It is evident that no simple rule of changing units relates L\ and Z,/; the 
difference between the two arises from a difference between the two opera- 
tional procedures defining size. In the next section we shall consider § 2 > 
the M.K.S. system of electrical units, and compare it with §i, the c.g.s. 
system. For that case the transformation of mechanical quantities (i.e. 
force, velocity, energy etc.) is a straightforward problem in changing units. 
For the electrical units, however, the change is of the form (Size) — > (Size)' 
discussed above since different definitions of the operations used for meas- 
uring charge, magnetic field etc. are used in the two systems. The fact 
that these two transformations are carried out simultaneously is probably 
the principle cause of confusion in seeing the relationships between the two 
systems. 

We shall next derive the formulae for the transformation of measured 
quantities for cases in which the method of measurement remains the same. 
Accordingly we consider some physical entity & such as a solid or region of 
space. We measure in system § x some particular property 9 of the entity 
and obtain a value X\. This result is expressed by writing 2 

entity & has XiWULfti' of 9. (7) 

Example: 

water has 1.0 cm -3 gm +1 of density. (7a) 

If we measure 9 of entity $ in S 2 , we write 

entity & has X 2 L 2 a M 2 6 T 2 c of 9 (8) 

water has X 2 ft -3 lbs +1 of density. (8a) 

The amount of property 9 should be independent of the method of measure- 
ment so that we may write 

X 1 L 1 a M 1 6 T 1 c = X 2 L 2 a U 2 b T 2 c (9) 

1.0 cm" 3 gm 1 = X 2 ft" 3 lb 1 . (9a) 

It is frequently convenient in manipulation of expressions like (9a) in 
order to find X 2 to proceed by multiplying the left side by factors of the 
form 

1 = (LiaLa/LO, 1 = (M 13 M„/Mi) (10) 

where we may use any convenient form of writing unity which accomplishes 

2 We shall use T rather than / for time in this appendix in order to be consistent with L 
and M. The use of M is intended to reduce confusion with m for mass and m for meter. 
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the desired cancellation shown below. This cancellation process appears 
as follows: 

^"(LxaLa/LO-M^fMxaMa/Mx) 6 • • • 

= Z 1 L 12 a M 12 6 • • • L 2 a M 2 6 • • • = X 2 L 2 °M 2 6 ■ ■ • (11) 

_ i i..iT y^. 1^.1111 

l.< 



. . _ 3 Z2.54 X 12 cm\ 3 / 121b \ 

1-0 cm 3 I I gm I — — ) 

V ft / B VlOOOgm/ 



= 1.0 (30.5) 3 ^ ft" 3 lb = X 2 ft" 3 lb. (11a) 

Since the unit symbols are the same, we have 

*iL 12 °M ia 6 ■ • ■ = X 2 (12) 

1.0 X 2.85 X 10 4 X 2.2 X lO" 3 = 62.7 = X x . (12a) 

Sometimes it is convenient to proceed in cascade: The life of a radioactive 
element is 2 years, convert this to seconds. 

365 days 24 hrs 3600 sec 

2 years • — — — 

1 year 1 day 1 hr 

= 2 • 365 • 24 • 3600 sec = 6.31 X 10 7 sec. (13) 

By virtue of the relationships (5), the transformation of measured 
magnitudes can also be written in terms of the Z 12 quantities. Thus if X is 
measured in units of L a M 6 T c , the relationship between Xi and X 2 may be 
written 

Xi = L 12 a M 12 b ...*,. (14) 

A number of the manipulations in the next section will be carried out in 
this form. 

(b) Electromagnetic Units. The electromagnetic units used in the 
text are in 

absolute electrostatic units for E, D, p, q, I (15) 

absolute electromagnetic units for H and B. (16) 

In these units Maxwell's equations and the force equation are 
Absolute c.g.s. (System 1 ) 

VXfli = (1M)[A + 4TA] (a) V X E l = (-1/^)^ (b) 

V • 5i = (c) V • 7>i = 4t P i (d) 

Bi = ^Tfi (e) !>! = Ke E x (f) 

h = 'A (g) *"i = qi[Ex + (1Ai)»i X Bx] (h) 

Cj = 2.998 X 10 10 cm/sec (i). (17) 

For Km — 1, B can be replaced by H in the force equation and the subscript 
e dropped from K e , the specific inductive capacity or dielectric constant. 
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In this system unit electric charges and magnetic poles produce unit field 
at one cm in free space. In terms of these requirements the units of E, H, 
B and D are the same and are M^L^T" 1 and while qi is M^L^T -1 . 
Using these dimensional formulae, equations (17) can be changed to other 
units for mass, length and time in accordance with the basic rule of part (a). 
The M.K.S. electromagnetic system cannot, however, be obtained from 
(14) simply by changing the units of length, mass and time. The reason 
is that Maxwell's equations in the M.K.S. system are not in absolute 
M.K.S. units, which could be obtained from (14) by the basic rule, but 
instead are modified so that none of the units in which the electrical 
quantities are measured are absolute M.K.S. units in the same sense that 
the units of equations (17) are absolute. For the purposes of this exposi- 
tion we shall regard the choice of the M.K.S. system of electrical and 
magnetic units as being dictated by two requirements: First, electrical 
quantities shall be in practical units so that unit charge is the coulomb 
(coul) and unit electric field is the volt per meter; these choices lead to a 
unit of electrical energy equal to the watt sec or joule which is also the unit 
of mechanical energy in the M.K.S. system. Second, the units of D, B 
and H shall be chosen so that the coefficients (1/c) and 4tt in parts (a), 
(b) and (d) of (17) disappear. This is accomplished by the following 
transformation scheme where subscript 2 implies the M.K.S. system: 

Table A.l Conversion between (1) Absolute c.g.s. 

and (2) M.K.S. Systems 



Mechanical 
100 cm = lOOLi = L 2 = meter 



Li = 100L 2 


L n = 10 2 


10 3 gm = 10 3 Mi = M 2 = kg 
Mi = 10 3 M 2 


M12 = 10 3 


Ti= T 2 


Tn = 1 


»i = (Li2/Tii)v 2 


012 = 10 2 



£ 1 = (A/i2^i2/r 12 2 )F 2 - £12 - 10 5 

Coulomb-Volt 
c 1IO -1 stat. coulombs = c ilO -1 qi = q 2 = coulomb 

qi - C ll0 -1 ?2 ?12 = c l/10 

(10 8 /d) stat. volts/100 cm = (10 6 /n)£i = £ 2 = volt/meter 

£1 = (10 6 Ai)£2 £12 = 10 6 Ax 

Additional Conversion Factors 

Pi = q\IVx = {qi2/L n )(.q 2 /Vij P12 = ci/W 

h = qi/L^Tx = W£i2 2 )/2 In = <i/10 6 

0-1 = /l/£l = (IWEu)ff2 <712 = fiVIO 11 

Di = 4xfil0- 6 D 2 D12 = 47T£ilO- 5 

Bi = 10 4 £ 2 B l2 = 10 4 

Hi = 4irl0- 3 # 2 //la = 4xlQ- 3 
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In terms of these conversions equations (12) become: 
M.K.S. System (System 2) 

V XH 2 = D 2 + h (a) V X E 2 = -B 2 (b) 

V • B 2 = (c) V • D 2 = P2 (d) 

^2 = K m fi H 2 (e) Z) 2 = /c c s £2 (0 

/ 2 = ^2^2 (g) F 2 = ?2 [£ 2 + v 2 X # 2 ] (h) 

mo = 47rl0- 7 = 1.257 X KT 6 , l/ Mo = 7.958 X 10 5 

1/4ttmo = 6.332 X 10 5 (i) 

so = lO 11 /^™-! 2 = 8.854 X KT 12 , 1/ao = 1-129 X 10 11 

l/4xs = 8.988 X 10 9 = ^ 2 2 10- 7 0) 

C2 = (eo/*>)~* = 2.998 X 10 8 m secf" 1 (k) (18) 

where K m and k„ have the same meaning and values in both systems. For 
purposes of illustration we shall verify that (17a) transforms to (18a): 

(V X H) x = (H 12 /L l2 )(y X H) 2 = 4,rl(r 5 (V X H) 2 (19a) 

(1/fi)^! + 4x/i] = (1/^)14x^10^^2 + 4x Cl 10- 5 / 2 ] (19b) 

the left sides are equal by (17a) and the right sides are 4u-10~ 5 times the 
left and right sides of (18a) which proves that (17a) converts to (18a). We 
shall also verify that (17f) converts to (18f): 

Z>i = 4«- 1 10- 5 Z> 2 = Ke E 1 = K e {\(f/c l )E 2 (20a) 

so that 

D 2 = K e (10 n /4irc 1 2 )E 2 = K eSo E 2 (20b) 

if so has the value given in (18j). 

The quantities in the M.K.S. form of Maxwell's equations have simple 
units when expressed in terms of the volt, coulomb (coul), sec system: 

Table A.2 Units in M.K.S. System 

qi coulombs Z) 2 coul m~ 2 

Pi coul m~ s Bi webers m -2 = Bz volt sec m~ 2 

h coul m -2 sec -1 Hi amp turns m~ 2 

(T2 ohm -1 m -1 im, volt sec amp -1 m _1 = no henry m _1 

E2 volt m _l So farads m" 



-l 



Unit of force: newton = 10 6 dynes 

Unit of energy: joule = newton m = watt sec = 10 7 dyne cm = 10 7 ergs. 



An alternative method of deriving the relationships of Table A. 1 is to 
consider the repulsion between corresponding ends of two similar thin 
solenoids each carrying flux m 2 = B 2 X (area of solenoid) and thus cor- 
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respond to magnetic poles of strength m 2 . In free space the force is 
required to be 

m 2 2 /4irW2 2 = 10W/16ir 2 r 2 2 - (21) 

This equation fixes m% in an absolute sense just as does the absolute c.g.s. 
system of requiring unit force. From this condition the ratio of B 2 to Z?i 
and to H% etc. may be determined leading to the definitions of Table A.l. 
A quantity of considerable use is the M.K.S. value of cm/^m , the charge 
to mass ratio of the electron where we have introduced the subscript M in 
place of 2 to facilitate comparison with Section 8.8. This has the value 

(e M /m M ) = 1-60 X 10- 19 /9.03 X 10~ 31 = 1.77 X 10 n coul/Kg. (22) 

The acceleration of an electron in a field of Em volts/meter is 

a = (e M /m M )EM = 1-77 X 10 n £ M m sec -1 . (23) 

The circular frequency in a field Bm is found by equating the acceleration 
to the force divided by mu thus obtaining 

a = o; 2 r = (eM/m M )urB M or u = {e M /mM)B M - (24) 

In a field of Bm — 1 or 10,000 gauss, the frequency is 

/ = a /2 r = (e M /m M )/2Tc = 2.82 X 10 10 sec -1 (25) 

which agrees with the value in Fig. 10.2. 

(c) Atomic Units (System 3). Another system of units, due to Hartree, 3 
and frequently employed in calculations of wave function is that in which 
the values of electronic charge, electronic mass and Dirac's h are unity. We 
shall relate this system, called S3, to the c.g.s. system: 

cm = L x = L13L3 = L 13 « = (0.528 X lO - " 8 ) -1 X Bohr orbit (26) 

gm = Mj = M 13 M 3 = (9.03 X lO" 28 )- 1 ^ (27) 

sec = Tj = T 13 T 3 = (2.419 X HT 17 )"^ (28) 

where to avoid confusion with the meter m e is used for the mass of the 
electron and «o is the Bohr orbit. The formulae for a and T3 are 

«o = tfm- l e- 2 = 0.528 X 10~ 8 cm (29a) 

T 3 = ft 3 ^- 1 -? -4 = 2.419 X 10- 17 sec (29b) 

m e = 9.03 X 10" 28 gm. (29c) 

In these units the speed of light has the value 

d cm/sec = c 3 a /T 3 = 137.29«o/T 3 (30) 

3 D. R. Hartree, Proc. Comb. Phil. Soc, 24, 89 (1926). The values agree with those in 
Condon and Shortley, The Theory of Atomic Spectra, Cambridge University Press, 1935. 
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where c% = 137.29 = cihi/e 2 is the reciprocal of the fine structure con- 
stant. The unit of energy is 

(energy)3 = m e a 2 TiT 2 = atomic unit = 27.07 electron volts. (31) 

The Bohr orbit calculation for a hydrogen atom becomes in these units 

§i §3 

fpdq = m e r 2 w2ir = k = 2irh r 2 co = 1 (32a) 

Force = e 2 /r 2 = m e rw 2 rV = 1 (32b) 

so that r — w = 1 is the solution and the energy is 

-e 2 /r + mw 2 r 2 /2 = -1 + \ = -\ or -13.53 electron volts. (33) 

In atomic units the wave equation for an electron in the field of a nucleus 
of charge Z is 

|V 2 \|r - (Z/r) = idty/dt = Si|/ (34) 

for which some of the lowest energy solutions are 

v|/ l8 = (Z^V^e-* (35a) 

\|/ 28 = (Z*V4v / 2^)(2- Zr)e~ Zr/2 (35b) 

y\i 2pz = (Z H /4\/ 7 br)Zre- Zrl2 cos 6 (35c) 

y\i 2px = (Z^/4V2i)Zre~ Zrl2 sin 6 cos * (35d) 

x|r 2P|f = (Z ? V4V2^)Zre- Zr/2 sin cos tf>, (35e) 

the energy for Is being — \Z 2 and for the others —\Z 2 . 
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Holes and electrons, formal equivalence in 

behavior of, 182, 443, 539 
Holes produced by alpha particles, 114 

Impurity centers, ionization energy of, 227 
Impurity semiconductor, 12-15 
Injection of carriers, 55-71, 91, 96 
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capacitance of, 100-101 
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Quantum state, a priori probability for, 
453 
concept of, 128 
Quantum states, 119-154 
conservation of, 132, 150 
distribution in energy of, 220-229, 453 
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Reciprocal lattice, 405 
Recombination-emission center, 345 
Recombination of holes, 67-69, 298, 318-328, 

347 
Rectification: see p-n junction and point 

contact 
Resistivity: see conductivity 
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Running waves, 135-136 

transformation to, 510-516 
Rutherford scattering, 262 

^-states, 455 

Saturation current in p-n junction, 314 
Saturation range of temperature, 244 
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formula, 258-264, 278-279 
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Schroedinger's equation, for eigenfunctions, 
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time dependent, 127, 372 
Selection rules, 516-520 
Shock-wave phenomenon, 332 
Silicon at room temperature, 247 
Spin, 128, 435 

Spreading resistance, 99, 104 
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37 
Stationary states, 382 

Statistical mechanics, relationship to thermo- 
dynamics, 459-461, 475-480 
Suhl effect, 71-74, 318, 325-328 

anomalous, 340 
Surface charges, 29 
Surface decay constant, 322 
Surface recombination velocity, 321 
Surface states, 31-34 

effect on rectification, 96 
Sweeping field, 60 

Thermal agitation, transitions due to, 264- 

270, 43 1-543 
Thermistor, 29 
Thermodynamics, relationship of statistical 

mechanics to, 459-461, 475-480 



Transistor, filamentary, 77-85 

p-n junction, 92-95 

type-A, 34-50, 101-114 
Transistor electronics, applications to, 295- 

349 
Transit time effects, in the filamentary trans- 
istor, 83-85 

in the type-A transistor, 106-108 
Transition probabilities, 189-193 

quantum theory of, 258-264, 487-491 
Transmission line, theory of, 365-368 
Traps, modulating, 345 
Type-A transistor, 34-50 

theory of, 101-114 

transit time effects in, 106-108 

Uncertainty relation, 418 

Unitary transformation, 504 

Units, atomic, 550-551 

basic rule for change of, 545-547 
electromagnetic, 547-550 
M.K.S. electromagnetic, 548-550 

Vacuum tube, analogy to, 94-95 

Valence band, pr valence-bond band: see 

energy bands 
Vectors, primitive, 399, 409 
reciprocal, 401 

Wave equations, 122-127 

Wave function, absolute value and complex 

conjugate of, 371 
Wave functions, analogues to, 123 

antisymmetric, 435-440 

determinantal, 437 

normalized, 369 

orthogonal, 382 
Wave packets, general, 162-166 

group velocity, 410-411 
Wave packets, quantum-mechanical, 

acceleration of, 424-435 

charge density and current density for, 
442-444 

general properties of, 417-424 

in terms of energy eigenfunctions, 444- 
445 

Zustandssumme, 476 



